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The  higher  infinite  refers  to  the  lofty  reaches  of  the  infinite  cardinalities  of  set  the- 
ory as  charted  out  by  large  cardinal  hypotheses.  These  hypotheses  posit  cardinals 
that  prescribe  their  own  transcendence  over  smaller  cardinals  and  provide  a super- 
structure for  the  analysis  of  strong  propositions.  As  such  they  are  the  rightful  heirs 
to  the  two  main  legacies  of  Georg  Cantor,  founder  of  set  theory:  the  extension 
of  number  into  the  infinite  and  the  investigation  of  definable  sets  of  reals.  The 
investigation  of  large  cardinal  hypotheses  is  indeed  a mainstream  of  modem  set 
theory,  and  they  have  been  found  to  play  a crucial  role  in  the  study  of  definable 
sets  of  reals,  in  particular  their  Lebesgue  measurability.  Although  formulated  at 
various  stages  in  the  development  of  set  theory  and  with  different  incentives,  the 
hypotheses  were  found  to  form  a linear  hierarchy  reaching  up  to  an  inconsistent 
extension  of  motivating  concepts.  All  known  set-theoretic  propositions  have  been 
gauged  in  this  hierarchy  in  terms  of  consistency  strength,  and  the  emerging  struc- 
ture of  implications  provides  a remarkably  rich,  detailed  and  coherent  picture  of 
the  strongest  propositions  of  mathematics  as  embedded  in  set  theory. 

The  first  of  a projected  multi-volume  series,  this  text  provides  a compre- 
hensive account  of  the  theory  of  large  cardinals  from  its  beginnings  through  the 
developments  of  the  early  1970’s  and  several  of  the  direct  outgrowths  leading  to 
the  frontiers  of  current  research.  A further  volume  will  round  out  the  picture  of 
those  frontiers  with  a wide  range  of  forcing  consistency  results  and  aspects  of 
inner  model  theory.  A genetic  account  through  historical  progression  is  adopted, 
both  because  it  provides  the  most  coherent  exposition  of  the  mathematics  and 
because  it  holds  the  key  to  any  epistemological  concerns.  With  hindsight  how- 
ever the  exposition  is  inevitably  Whiggish,  in  that  the  consequential  avenues  are 
pursued  and  the  most  elegant  or  accessible  expositions  given.  Each  section  is  a 
modular  unit,  and  later  sections  often  describe  how  concepts  discussed  in  earlier 
sections  inspired  the  next  advance.  With  speculations  and  open  questions  pro- 
vided throughout,  the  reader  should  not  only  come  to  appreciate  the  scope  and 
significance  of  the  overall  enterprise  but  also  become  prepared  to  pursue  research 
in  several  specific  areas. 

In  what  follows  a historical  and  conceptual  overview  is  given,  one  that  serves 
to  embed  the  sections  of  the  text  into  a larger  framework.  In  an  appendix  larger  and 
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more  discursive  issues  that  may  be  raised  by  the  investigation  of  large  cardinals 
are  taken  up.  See  Hallett  [84],  Lavine  [94],  Moore  [82],  and  Fraenkel-Bar-Hillel- 
Levy  [73]  for  more  on  the  development  of  set  theory;  several  themes  that  are  only 
broached  here  are  substantiated  in  at  least  one  of  these  sources. 

The  Beginnings  of  Set  Theory 

Set  theory  had  its  beginnings  in  the  great  19th  Century  transformation  of  math- 
ematics that  featured  the  arithmetization  of  analysis  and  a new  engagement  with 
abstraction  and  generalization.  Very  much  new  mathematics  growing  out  of  old, 
the  subject  did  not  spring  Athena-like  from  the  head  of  Cantor  but  in  a gradual 
process  out  of  problems  in  mathematical  analysis.  In  the  wake  of  the  founding  of 
the  calculus  by  Isaac  Newton  and  Gottfried  Leibniz  the  function  concept  had  been 
steadily  extended  from  analytic  expressions  toward  arbitrary  correspondences,  in 
the  course  of  which  the  emphasis  had  shifted  away  from  the  continuum  taken  as  a 
whole  to  its  construal  as  a collection  of  points,  the  real  numbers.  The  first  major 
expansion  had  been  inspired  by  the  explorations  of  Leonhard  Euler  and  featured 
the  infusion  of  infinite  series  methods  and  the  analysis  of  physical  phenomena, 
particularly  the  vibrating  string. 

Working  out  of  this  tradition  the  young  Cantor  in  the  early  1870’s  established 
uniqueness  theorems  for  trigonometric  series  in  terms  of  their  points  of  conver- 
gence, theorems  based  on  collections  of  reals  defined  through  a limit  operation 
iterable  into  the  infinite.  In  a crucial  conceptual  move  Cantor  began  to  investigate 
such  collections  and  infinitary  enumerations  for  their  own  sake,  and  this  led  first 
to  basic  concepts  in  the  study  of  sets  of  reals  and  then  to  the  formulation  of  the 
transfinite  numbers.  Set  theory  was  bom  on  that  December  1873  day  when  Cantor 
established  that  the  reals  are  uncountable,  i.e.  there  is  no  one-to-one  correspon- 
dence between  the  reals  and  the  natural  numbers,  and  in  the  next  decades  was  to 
blossom  through  the  prodigious  progress  made  by  him  in  the  theory  of  ordinal 
and  cardinal  numbers.  But  a synthesis  of  the  reals  as  representing  the  continuum 
and  the  new  numbers  as  representing  well-orderings  eluded  him:  Cantor  could  not 
establish  the  Continuum  Hypothesis,  that  the  cardinality  2K°  of  the  set  of  reals  is 
the  least  uncountable  cardinality  N| , part  of  his  problem  being  that  he  could  not 
define  a well-ordering  of  the  reals. 

Cantor  came  to  view  the  finite  and  the  transfinite  as  all  of  a piece,  simi- 
larly comprehendable  within  mathematics,  and  delimited  by  what  he  termed  the 
“Absolute”  which  he  associated  mathematically  with  the  class  of  all  ordinals  and 
metaphysically  with  God.  As  part  of  this  realist  picture  Cantor  viewed  sets,  at  least 
until  the  early  1890’s,  as  inherently  structured  with  a well-ordering  of  their  mem- 
bers. Ordinal  and  cardinal  numbers  resulted  from  successive  abstraction,  from  a 
set  x to  its  ordertype  x and  then  to  its  cardinality  x. 

But  such  a structured  view  served  to  accentuate  a growing  stress  among 
mathematicians,  who  were  already  exercised  by  two  related  issues:  whether  in- 
finite collections  can  be  investigated  within  mathematics  at  all  and  how  far  the 
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function  concept  is  to  be  extended.  The  positive  use  of  an  arbitrary  function  having 
been  made  explicit,  there  was  open  controversy  after  Ernst  Zermelo  [04]  formu- 
lated what  he  soon  called  the  Axiom  of  Choice  and  established  his  Well-Ordering 
Theorem,  that  the  axiom  implies  every  set  can  be  well-ordered. 

With  axiomatization  assuming  a general  methodological  role  in  mathematics 
Zermelo  [08a]  soon  published  the  first  axiomatization  of  set  theory.  But  as  with 
Cantor’s  work  the  move  was  in  response  to  mathematical  pressure  for  a new 
context:  Beyond  the  stated  purpose  of  securing  set  theory  from  paradox  Zermelo’s 
main  motive  was  apparently  to  buttress  his  Well-Ordering  Theorem  by  making 
explicit  its  underlying  set  existence  assumptions.  In  the  process,  he  shifted  the 
focus  away  from  the  transfinite  numbers  to  an  abstract  view  of  sets  structured 
solely  by  e and  simple  operations.  Extracted  from  a specific  proof  (for  the  Well- 
Ordering  Theorem  in  his  [08])  Zermelo’s  axioms  had  the  advantages  of  simplicity 
and  open-endedness.  The  generative  set  formation  axioms,  especially  Power  Set, 
were  to  lead  to  Zermelo’s  later  adumbration  [30]  of  the  cumulative  hierarchy  view 
of  sets,  and  the  vagueness  of  the  definit  property  in  the  Separation  Axiom  was  to 
invite  Thoralf  Skolem’s  [23]  proposal  to  base  it  on  first-order  logic. 

Skolem’s  move  was  in  the  wake  of  a mounting  initiative,  one  that  was  to 
expand  set  theory  with  new  viewpoints  and  techniques  as  well  as  to  invest  it 
with  a larger  foundational  significance.  Gottlob  Frege  is  regarded  as  the  greatest 
philosopher  of  logic  since  Aristotle  for  developing  his  1879  Begriffsschrift  (quan- 
tificational  logic),  establishing  a logical  foundation  for  arithmetic,  and  generally 
stimulating  the  analytic  tradition  in  philosophy.  The  architect  of  that  tradition 
is  Bertrand  Russell  who  in  his  early  years,  influenced  by  Frege  and  Giuseppe 
Peano,  wanted  to  found  all  of  mathematics  on  the  certainty  of  logic.  The  vaulting 
expression  of  that  ambition  was  the  1910-3  three  volume  Principia  Mathematica 
by  Alfred  Whitehead  and  Russell.  But  Russell  was  exercised  by  his  well-known 
paradox,  one  which  led  to  the  tottering  of  Frege’s  mature  formal  system.  As  a 
result  Principia  was  encased  in  a complex  logical  system  of  different  types  and 
intensional  predications  ultimately  breaking  under  his  Axiom  of  Reducibility,  a 
fearful  symmetry  imposed  by  an  artful  dodger. 

It  remained  for  David  Hilbert  to  shift  the  ground  and  establish  mathematical 
logic  as  a field  of  mathematics.  Russell’s  philosophical  disposition  precluded  his 
axiomatizing  logic,  but  Hilbert  brought  it  under  scrutiny  as  he  did  Euclidean  geom- 
etry by  establishing  an  axiomatic  context  and  raising  the  crucial  questions  of  con- 
sistency and  later,  completeness.  This  largely  syntactic  approach  was  soon  given 
a superstructure  when  in  response  to  intuitionistic  criticism  by  Luitzen  Brouwer 
and  Hermann  Weyl,  Hilbert  developed  proof  theory  and  proposed  his  program  of 
establishing  the  consistency  of  classical  mathematics  with  his  metamathematics. 
These  issues  gained  currency  because  of  Hilbert’s  preeminence,  just  as  mathe- 
matics in  the  large  was  expanded  by  his  reliance  on  non-constructive  proofs  and 
transcendental  methods.  Through  this  expansion  the  Axiom  of  Choice  became  a 
mathematical  necessity,  particularly  because  of  maximality  arguments  in  algebra, 
and  arbitrary  functions  became  implicitly  accepted  in  the  growing  investigation  of 
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higher  function  spaces.  With  the  increasing  emphasis  on  frameworks  and  struc- 
tures, the  power  set  operation  became  incorporated  into  mathematics. 

Throughout,  Zermelian  set  theory  grew  as  the  mathematical  repository  of 
foundational  concerns  and  initiatives.  As  the  first  result  of  his  axiomatic  set 
theory  Zermelo  [08a]  himself  put  the  Russell  paradox  argument  to  use  to  show 
that  for  any  set  x there  is  a set  jCj  such  that  y <£  x (so  that  there  is  no  universal 
set).  Friedrich  Hartogs  [15]  in  effect  converted  the  Burali-Forti  paradox  into  the 
existence  for  any  set  x of  a well-orderable  set  y not  injectible  into  x.  Analyzing 
the  Zermelo  [08]  proof  Kazimierz  Kuratowski  [21]  provided  that  definition  of 
the  ordered  pair,  antithetical  to  Russell’s  type-ridden  theory,  which  became  the 
standard  way  to  reduce  the  theory  of  relations  to  sets.  And  then  Skolem  [23] 
made  his  proposal  of  rendering  Zermelo’s  Separation  Axiom  in  terms  of  properties 
expressible  in  first-order  logic. 

More  than  that,  Skolem  intended  for  set  theory  to  be  based  on  first-order 
logic  with  e construed  syntactically  and  without  a privileged  interpretation.  This 
becomes  clear  in  his  application  of  the  Lowenheim-Skolem  theorem  to  get  the 
Skolem  paradox:  the  existence  of  countable  models  of  set  theory  although  it  entails 
the  existence  of  uncountable  sets.  Ironically,  Skolem  intended  by  this  means  to 
deflate  the  possibility  of  set  theory  becoming  a foundation  for  mathematics,  but 
following  Kurt  Gddel’s  work  Skolem’s  syntactical  approach  to  set  theory  came 
to  be  accepted.  And  again  the  ways  of  paradox  were  absorbed  into  set  theory, 
as  the  Lowenheim-Skolem  theorem  came  to  play  an  important  internal  role  when 
semantic  methods  were  ushered  in  by  Alfred  Tarski. 

Skolem  [23]  also  and  Abraham  Fraenkel  [21,22]  independently  proposed  the 
addition  of  the  Replacement  Axiom  to  Zermelo’s  list,  and  this  axiom  soon  figured 
in  a counter-reformation  of  sorts.  John  von  Neumann  [23]  introduced  the  ordinals 
(transitive  sets  well-ordered  by  e)  and  showed  that  every  well-ordering  is  iso- 
morphic to  an  ordinal,  thereby  restoring  Cantor’s  transfinite  numbers  as  sets.  No 
longer  were  the  numbers  abstractions,  but  in  the  new  formulation  became  incor- 
porated into  the  Zermelian  framework  of  sets  built  up  by  e and  simple  operations. 
Von  Neumann’s  particular  approach  to  axiomatization  fostered  the  liberal  use  of 
proper  classes  in  set  theory  and  brought  Replacement  into  prominence  through  its 
role  in  definitions  by  transfinite  recursion. 

With  these  developments  before  him  Zermelo  [30]  presented  his  final  ax- 
iomatization of  set  theory,  incorporating  Replacement  and  also  Foundation.  This 
axiomatization  was  in  second-order  terms,  allowed  urelements,  and  eschewed  the 
Axiom  of  Infinity,  but  shorn  of  these  features  it  became  the  standard  Zermelo- 
Fraenkel  (ZFC)  one  when  recast  in  the  soon  to  emerge  terms  of  first-order  logic. 
The  Foundation  Axiom  had  been  prefigured  as  a restricting  possibility  by  Dmitry 
Mirimanov  [17],  Skolem  [23],  and  von  Neumann  [25].  Zermelo  offered  a syn- 
thetic view  of  a succession  of  natural  models  for  set  theory,  each  a member  of 
a next,  essentially  realizing  that  Foundation  ranks  the  sets  in  these  models  into  a 
cumulative  hierarchy.  In  current  terms  the  axiom  stratifies  the  formal  universe  V 
of  sets  as  Ua  Va,  where  Vq  is  0,  Va+i  is  the  power  set  of  Va,  and  Vg  for  limit 
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ordinals  8 is  the  union  of  the  V^’s  for  a < 8.  In  a notable  inversion  this  iterative 
conception  came  to  be  accepted  after  Godel’s  later  advocacy  as  a heuristic  for 
motivating  the  axioms  of  set  theory  generally,  its  open-endedness  moreover  pro- 
moting a principle  of  tolerance  for  motivating  new  hypotheses  mediating  toward 
Cantor’s  Absolute.  Foundation  is  the  one  axiom  unnecessary  for  the  recasting 
of  mathematics  in  set-theoretic  terms,  but  it  came  to  be  the  salient  feature  that 
distinguishes  structural  investigations  specific  to  set  theory.  Indeed,  it  can  be 
fairly  said  that  modem  set  theory  is  the  study  of  well-foundedness,  the  Cantorian 
well-ordering  doctrines  adapted  to  the  Zermelian  combinatorial  conception  of  sets. 

In  the  1930’s  Godel’s  incisive  analyses  brought  about  a transformation  of 
mathematical  logic  based  on  new  initiatives  for  mathematical  elucidation.  The 
main  source  was  of  course  his  Incompleteness  Theorem  [31],  which  led  to  the 
undecidability  of  validity  for  first-order  logic  and  the  development  of  recursion 
theory.  But  starting  an  undercurrent,  the  earlier  Completeness  Theorem  [30]  clar- 
ified the  distinction  between  the  semantics  and  syntax  of  first-order  logic  and 
secured  its  key  instrumental  property,  compactness.  Then  Tarski  [33, 35]  set  out 
his  schematic  definition  of  truth  in  set-theoretic  terms,  exercising  philosophers  to  a 
surprising  extent  ever  since.  The  groundwork  had  been  laid  for  the  development  of 
model  theory,  and  set  theory  was  to  be  considerably  enriched  since  the  1950’s  by 
model-theoretic  techniques.  First-order  logic  came  to  be  accepted  as  the  canonical 
language  because  of  its  mathematical  possibilities,  Skolem’s  earlier  suggestion  for 
set  theory  taken  up  generally,  and  higher-order  logics  became  downgraded  as  the 
workings  of  the  power  set  operation  in  disguise. 

So  enriched  and  fortified  by  axioms,  results,  and  techniques  axiomatic  set  the- 
ory was  launched  on  its  independent  course  by  Godel’s  construction  of  L [38, 39] 
leading  to  the  relative  consistency  of  the  Axiom  of  Choice  and  the  Continuum 
Hypothesis.  Synthesizing  what  came  before,  Godel  built  on  the  von  Neumann 
ordinals  as  sustained  by  Replacement  to  formulate  a relative  Zermelian  universe 
of  sets  based  on  logical  definability,  a universe  imbued  with  a Cantorian  sense  of 
order. 

Large  Cardinals 

If  the  foregoing  in  brief  (and  with  interpretative  twists)  is  the  high  tradition  of  set 
theory  from  Cantor  to  Godel,  large  cardinals  are  the  trustees  of  older  traditions  in 
direct  line  from  Cantor’s  original  investigations  of  definable  sets  of  reals  and  of 
the  transfinite  numbers.  Before  taking  up  the  more  continuous  tradition  having  to 
do  directly  with  the  transfinite  the  other  tradition  is  described,  one  that  was  to  be 
revitalized  in  the  1960’s  by  major  new  initiatives. 

Descriptive  set  theory  is  the  definability  theory  of  the  continuum,  the  study  of 
the  structural  properties  of  definable  sets  of  reals.  In  his  most  substantive  approach 
to  the  Continuum  Hypothesis  Cantor  had  structured  the  problem  via  perfect  sets 
and  established  that  the  closed  sets  have  the  perfect  set  property  (11.3).  Related 
were  his  contributions  to  measure  theory,  a theory  that  led  to  the  Borel  sets  and 
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of  course  to  Lebesgue  measure.  The  major  incentives  of  descriptive  set  theory 
have  been  to  approach  sets  of  reals  through  definability  as  Cantor  had  done,  and  to 
investigate  the  extent  of  the  regularity  properties,  of  which  Lebesgue  measurability 
and  Cantor’s  perfect  set  property  are  two.  In  a seminal  paper  Henri  Lebesgue  [05] 
provided  the  first  hierarchy  for  the  Borel  sets  and  applied  Cantor’s  diagonalization 
argument  to  show  that  the  hierarchy  is  both  proper  and  does  not  exhaust  the 
definable  sets  of  reals.  The  subject  really  began  with  Mikhail  Suslin’s  discovery 
[17]  of  the  analytic  sets  and  fundamental  results  about  this  first  level  of  the  later 
projective  hierarchy.  The  subsequent  development  by  Nikolai  Luzin,  Waclaw 
Sierpinski,  and  their  collaborators  featured  tree  representations  of  sets  of  reals, 
and  it  was  through  this  opening  that  well-founded  relations  entered  mathematical 
practice,  the  later  tradition  leading  to  Foundation  and  the  iterative  conception  being 
quite  separate  and  motivated  by  heuristics.  The  transfinite  numbers,  at  least  the 
countable  ones,  gained  a further  legitimacy  through  their  necessary  involvement  in 
this  work,  contributing  to  the  mathematical  pressure  for  their  general  acceptance. 
Pressing  upward  in  the  projective  hierarchy,  by  the  early  1930’s  the  descriptive 
set  theorists  had  reached  an  impasse,  one  that  was  to  be  later  explained  by  Godel’s 
delimitative  results  with  L.  (These  matters  are  taken  up  in  §§12, 13.) 

The  other,  more  primal  Cantorian  initiative,  the  mathematical  investigation  of 
the  transfinite,  was  vigorously  advanced  into  the  higher  infinite  by  Felix  Hausdorff 
[08].  Dismissive  of  foundational  issues,  he  pursued  the  structure  of  transfinite 
ordertypes  for  its  own  sake  and  was  first  to  consider  a large  cardinal,  a weakly 
inaccessible  cardinal,  as  a natural  limit  point.  Paul  Mahlo  [11, 12, 13]  then  studied 
stronger  limit  points,  the  Mahlo  cardinals.  Closure  under  the  power  set  operation, 
intrinsic  to  the  Zermelian  set  concept,  was  later  incorporated  in  the  concept  of 
a (strongly)  inaccessible  cardinal  by  Sierpinski-Tarski  [30]  and  Zermelo  [30].  In 
the  early  semantic  investigations  before  the  general  acceptance  of  first-order  logic 
these  cardinals  provided  the  natural  models  for  set  theory,  i.e.  the  corresponding 
initial  segments  of  the  cumulative  hierarchy.  (These  topics  are  developed  in  §1.) 

Measurability,  the  most  prominent  of  all  large  cardinal  hypotheses,  embodied 
the  first  confluence  of  the  Cantorian  initiatives.  Isolated  by  Stanislaw  Ulam  [30] 
from  measure-theoretic  considerations  related  to  Lebesgue  measure,  the  concept 
also  entailed  inaccessibility  in  the  transfinite.  Moreover,  the  initial  airing  generated 
an  open  problem  that  was  to  keep  the  spark  of  large  cardinals  alight  for  the  next 
three  decades:  Can  the  least  inaccessible  cardinal  be  measurable?  (Measurability 
is  discussed  in  §2.) 

The  further  development  of  the  higher  infinite  was  to  depend  on  model- 
theoretic  techniques  brought  into  set  theory  in  the  course  of  its  larger  develop- 
ment. Godel’s  L was  the  first  example  of  an  inner  model,  a class  (definable  by  a 
formula  of  first-order  logic)  including  all  the  ordinals,  which  with  e restricted  to 
it  is  a model  of  the  axioms.  Godel  with  L had  in  fact  established  the  minimum 
possibility  for  the  set-theoretic  universe,  and  large  cardinals  were  to  provide  the 
counterweight  first  in  reaction  and  then  for  generalization.  Godel’ s realist  specula- 
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tions,  especially  about  Cantor’s  Continuum  Problem,  contained  the  seeds  of  later 
heuristic  arguments  for  large  cardinal  hypotheses: 

The  set-theoretic  universe  V viewed  as  the  cumulative  hierarchy  [Ja  Va  is 
open-ended  and  under-determined  by  the  set-theoretic  axioms,  and  invites  fur- 
ther postulations  based  on  reflection  and  generalization.  In  1946  remarks  Godel 
[90: 151]  suggested  reflection  in  terms  of  a set-theoretic  proposition  being  prov- 
able in  “the  next  higher  system  above  set  theory”,  which  proof  being  replaceable 
by  one  from  “an  axiom  of  infinity”.  This  ties  in  with  V cast  as  Cantor’s  Abso- 
lute being  mathematically  incomprehendable,  so  that  any  property  ascribable  to  it 
must  already  hold  in  some  sufficiently  large  Va,  some  properties  leading  directly 
to  large  cardinal  hypotheses.  In  a 1966  footnote  Godel  [90:260ff]  acknowledged 
“strong  axioms  of  infinity  of  an  entirely  new  kind”,  generalizations  of  properties 
of  Ko  “supported  by  strong  arguments  from  analogy”.  This  ties  in  with  Cantor’s 
unitary  view  of  the  finite  and  transfinite,  with  properties  like  inaccessibility  and 
measurability  technically  satisfied  by  Ko  being  too  accidental  were  they  not  also 
ascribable  to  higher  cardinals.  Both  reflection  and  generalization  are  latent  in  the 
eternal  return  of  successive  domains  as  envisioned  by  Zermelo  [30].  Whatever 
the  heuristics,  the  theory  of  large  cardinals  like  other  mathematical  investigations 
was  to  be  driven  by  open  problems  and  growing  structural  elucidations.  (These 
matters  are  taken  up  in  §3.  Other  heuristic  arguments  are  described  in  Maddy 
[88,88a].) 

The  generalization  of  first-order  logic  allowing  infinitary  logical  operations 
was  to  lead  to  the  solution  of  that  problem  of  whether  the  least  inaccessible  cardinal 
can  be  measurable.  Tarski  [62]  defined  the  strongly  compact  and  weakly  compact 
cardinals  by  ascribing  natural  generalizations  of  the  key  compactness  property  of 
first-order  logic  to  the  corresponding  infinitary  languages.  A strongly  compact 
cardinal  is  measurable,  and  a measurable  cardinal  is  weakly  compact.  Tarski’s 
student  William  Hanf  [64]  then  established  (4.7)  that  there  are  many  inaccessible 
cardinals  (and  Mahlo  cardinals)  below  a weakly  compact  cardinal.  In  particular, 
the  least  inaccessible  cardinal  is  not  measurable.  Hanf  s work  radically  altered 
size  intuitions  about  properties  coming  to  be  understood  in  terms  of  large  cardinals. 
(These  topics  are  developed  in  §4.) 

In  the  early  1960’s  set  theory  was  veritably  transformed  by  structural  initia- 
tives based  on  new  possibilities  for  constructing  well-founded  models  and  estab- 
lishing relative  consistency  results.  This  was  due  largely  to  the  creation  of  forcing 
by  Paul  Cohen  [63,64],  who  happened  upon  a remarkably  fertile  technique  for 
producing  extensions  of  models  of  set  theory.  In  a different  vein,  a seminal  result 
of  Dana  Scott  [61]  stimulated  the  investigation  of  elementary  embeddings  of  inner 
models.  The  ultraproduct  construction  of  model  theory  was  just  gaining  currency 
when  Scott  took  an  ultrapower  of  V itself  to  establish  (5.5)  that  if  there  is  a mea- 
surable cardinal,  then  V f L.  Large  cardinal  hypotheses  thus  assumed  a new 
significance  as  a means  for  maximizing  possibilities  away  from  Godel’s  delimi- 
tative  construction.  And  Cantor’s  Absolute  notwithstanding,  Scott’s  construction 
began  the  liberal  use  of  manipulative  inner  model  constructions  in  set  theory.  It 
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was  in  this  richer  setting  that  measurable  cardinals  came  to  play  a central  structural 
role,  being  necessary  for  securing  well-founded  ultrapowers  (see  5.6  and  before): 
There  is  an  elementary  embedding  j:  V — » M for  some  inner  model  M iff  there  is 
a measurable  cardinal.  (These  matters  are  taken  up  in  §5.) 

With  reflection  arguments  emerging  in  the  model-theoretic  approaches  taken 
in  set  theory,  Azriel  Levy  [60a]  established  their  broader  significance  and  the 
close  involvement  of  Mahlo  cardinals.  Then  Hanf-Scott  [61]  formulated  the  inde- 
scribable cardinals,  directly  positing  reflection  properties  in  terms  of  higher-order 
languages,  and  showed  that  these  cardinals  provide  a schematic  approach  to  com- 
paring large  cardinals  by  size.  Levy  [71]  then  provided  a systematic  analysis, 
features  of  which  were  to  occur  in  later  contexts.  (Indescribability  is  described 
in  §6.) 

Scott’s  result  that  if  there  is  a measurable  cardinal  then  V f L naturally  led 
to  refinements  both  weakening  the  hypothesis  and  strengthening  the  conclusion. 
Notably,  the  first  moves  were  made  in  the  context  of  the  infinitary  combinatorics 
then  being  developed  by  Paul  Erdos  and  his  collaborators,  the  study  of  partition 
properties,  which  are  transfinite  generalizations  of  a result  of  Frank  Ramsey  [30]. 
Frederick  Rowbottom  [64, 71]  then  established  a partition  property  for  measurable 
cardinals  (7.17),  and  using  model-theoretic  methods  showed  that  such  properties 
already  imply  that  there  are  only  countably  many  reals  in  L (8.3).  This  blending 
of  model  theory  and  infinitary  combinatorics  led  to  a spectrum  of  large  cardinals 
positing  strong  versions  of  the  Lowenheim-Skolem  theorem,  the  Rowbottom  and 
Jonsson  cardinals  in  particular  generating  intriguing  questions.  Weaving  in  the 
crucial  model-theoretic  concept  of  a set  of  indiscernibles  Jack  Silver  [66, 71]  then 
analyzed  what  came  to  be  regarded  as  the  essence  of  transcendence  over  L,  encap- 
sulated by  him  and  Robert  Solovay  [67]  as  a set  0#  of  integers  coding  a collection 
of  sentences  uniquely  specified  by  indiscernibility  conditions.  Beyond  a web  of 
implications  encircling  the  merely  negative  conclusion  V f L,  the  existence  of 
0#  is  a strikingly  informative  assertion  about  just  how  starkly  L is  generated  in  a 
transcendent  V.  Subsequent  results  have  buttressed  the  existence  of  0#  as  a piv- 
otal hypothesis,  and  its  isolation  is  the  first  real  triumph  for  large  cardinals  in  the 
elucidation  of  set-theoretic  structure.  (These  matters  are  taken  up  in  Chapter  2.) 

Returning  to  the  early  1960’s,  if  Godel’s  construction  of  L had  launched 
axiomatic  set  theory  as  a distinctive  field  of  mathematics,  then  Cohen’s  technique 
of  forcing  began  its  transformation  into  a modern,  sophisticated  one.  Starting  with 
his  work  on  the  Continuum  Hypothesis  many  problems  that  had  been  left  unre- 
solved were  shown  to  be  independent,  as  set  theorists  were  presented  a remarkably 
general  and  flexible  scheme  with  intuitive  underpinnings  for  constructing  models 
of  set  theory.  The  thrust  of  research  gradually  deflated  the  Cantor-Godel  real- 
ist view  with  an  onrush  of  new  models,  and  shedding  some  of  its  foundational 
burden  set  theory  became  an  intriguing  mathematical  subject  where  formalized 
versions  of  truth  and  consistency  became  matters  for  combinatorial  manipulation 
as  in  algebra.  From  Skolem  relativism  to  Cohen  relativism  the  role  of  set  theory 
for  mathematics  became  even  more  evidently  one  of  an  open-ended  framework 
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rather  than  an  elucidating  foundation.  From  this  point  of  view,  that  the  ZFC  ax- 
ioms do  not  determine  the  cardinality  2K°  of  the  set  of  reals  seems  an  entirely 
satisfactory  state  of  affairs.  With  the  richness  of  possibility  for  arbitrary  reals 
and  mappings,  no  axioms  that  do  not  directly  impose  structure  from  above  should 
constrain  a set  as  open-ended  as  the  collection  of  reals  or  its  various  possibilities 
for  well-ordering. 

Inaccessible  cardinals  figured  from  the  beginning  in  this  sea-change,  first 
in  the  concept  of  the  Levy  collapse  and  then  in  its  use  in  Solovay’s  inspiring 
result  [65b,  70]  that  if  there  is  an  inaccessible  cardinal,  then  in  a submodel  of  a 
forcing  extension  every  set  of  reals  is  Lebesgue  measurable  and  has  the  perfect  set 
property.  (The  Axiom  of  Choice  necessarily  fails  in  this  submodel.)  As  Cohen’s 
independence  of  the  Continuum  Hypothesis  did  for  the  transfinite,  this  result  on  the 
regularity  of  sets  of  reals  not  only  resolved  old  axiomatic  issues  but  reinvigorated 
the  Cantorian  initiatives  by  suggesting  new  mathematical  possibilities.  Solovay 
[69]  soon  applied  the  ideas  of  his  proof  to  show  that  measurable  cardinals  directly 
imply  the  regularity  properties  at  the  level  of  Godel’s  delimitative  results  with  L, 
revitalizing  the  classical  program  of  descriptive  set  theory.  Then  Donald  Martin 
and  Solovay  (cf.  their  [69])  applied  large  cardinal  hypotheses  at  the  level  of  0# 
to  push  forward  the  old  tree  representation  ideas,  with  the  hypotheses  cast  in  the 
new  role  of  securing  well-foundedness  in  this  context.  (These  matters  are  taken 
up  in  Chapter  3.) 

The  perfect  set  property  led  to  the  first  instance  of  a new  phenomenon  in 
set  theory:  the  derivation  of  equiconsistency  results  based  on  the  complementary 
methods  of  forcing  and  inner  models.  A large  cardinal  hypothesis  is  typically 
transformed  into  a proposition  about  sets  of  reals  by  forcing  that  “collapses”  the 
cardinal  to  Ki  or  “enlarges”  the  power  of  the  continuum  to  the  cardinal.  Con- 
versely, the  proposition  entails  the  same  large  cardinal  hypothesis  in  the  clarity  of 
an  inner  model.  Solovay’s  result  provided  the  forcing  direction  from  an  inacces- 
sible cardinal  to  the  proposition  that  every  set  of  reals  has  the  perfect  set  property. 
But  Ernst  Specker  [57]  had  in  effect  established  that  if  every  set  of  reals  has  the 
perfect  set  property  (and  Ki  is  regular),  then  Kj  is  inaccessible  in  L (11.6).  Thus, 
Solovay’s  use  of  an  inaccessible  cardinal  was  necessary,  and  its  collapse  to  Kj 
complemented  Specker’ s observation.  Years  later,  Saharon  Shelah  [84]  was  able 
to  establish  the  necessity  of  Solovay’s  inaccessible  also  for  the  proposition  that 
every  set  of  reals  is  Lebesgue  measurable. 

The  emergence  of  such  equiconsistency  results  is  a subtle  transformation  of 
earlier  hopes  of  Godel:  Propositions  can  indeed  be  resolved  if  there  are  enough 
ordinals,  how  many  being  specified  by  positing  a large  cardinal.  But  the  resolution 
is  in  terms  of  the  Hilbertian  concept  of  consistency,  the  methods  of  forcing  and 
inner  models  being  the  operative  modes  of  argument.  In  a new  synthesis  of  the  two 
Cantorian  initiatives,  hypotheses  of  length  concerning  the  extent  of  the  transfinite 
are  correlated  with  hypotheses  of  width  concerning  sets  of  reals.  There  is  a telling 
antecedent  in  the  result  of  Gerhard  Gentzen  [36, 43]  that  the  consistency  strength 
of  arithmetic  can  be  exactly  gauged  by  an  ordinal  e0,  i.e.  transfinite  induction  up 
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to  that  ordinal  in  a formal  system  of  notations.  Although  Hilbert’s  program  of 
establishing  consistency  by  finitary  means  cannot  be  realized,  Gentzen  provided 
an  exact  analysis  in  terms  of  ordinal  length.  Proof  theory  blossomed  in  the  1960’s 
with  the  analysis  of  other  theories  in  terms  of  such  lengths,  the  proof  theoretic 
ordinals. 

In  the  late  1960’s  a wide-ranging  investigation  of  measurability  was  carried 
out  with  forcing  and  inner  models.  These  developments  not  only  provided  an  illu- 
minating structural  analysis,  but  suggested  new  questions  and  provided  paradigms 
for  the  subsequent  investigation  of  stronger  hypotheses.  Solovay  [66,71]  brought 
the  concept  of  saturated  ideal  to  the  forefront,  establishing  an  equiconsistency 
result  about  real-valued  measurability.  Subsequent  work  showed  that  saturated 
ideals  are  a flexible  generalization  of  measurability  that  can  occur  low  in  the  cu- 
mulative hierarchy.  Exploiting  the  technique  of  iterated  ultrapowers  developed  by 
Haim  Gaifman  [64],  Kenneth  Kunen  [70]  established  the  main  structure  theorems 
for  inner  models  of  measurability.  Not  only  do  these  models  have  the  minimal 
structure  of  Godel’s  L,  but  they  turn  out  to  be  exactly  the  ultrapowers  of  each 
other,  and  such  coherence  amounts  to  strong  evidence  for  the  consistency  of  the 
concept  of  measurability.  Kunen  also  established  a characterization  of  the  exis- 
tence of  0#  in  terms  of  the  non-rigidity  of  L : 0#  exists  iff  there  is  an  elementally 
embedding  j:  L -*  L.  Solovay  isolated  a set  O'  that  plays  an  analogous  role 
for  inner  models  of  measurability  that  0#  does  for  L,  and  its  existence  has  a 
similar  characterization  in  terms  of  non-rigidity.  (These  topics  are  developed  in 
Chapter  4.) 

Even  as  measurability  was  being  methodically  investigated,  Solovay  and 
William  Reinhardt  were  charting  out  stronger  hypotheses.  Taking  the  concept 
of  elementary  embedding  as  basic  they  independently  formulated  the  concept  of 
supercompact  cardinal  as  a generalization  of  both  measurability  and  strong  com- 
pactness, and  Reinhardt  formulated  the  stronger  concept  of  extendible  cardinal 
with  motivating  ideas  based  directly  on  reflection.  Reinhardt  briefly  considered 
an  ultimate  reflection  property  along  these  lines,  but  in  a dramatic  turn  of  events 
Kunen  [71b]  established  that  this  prima  facie  extension  is  inconsistent:  There  is 
no  elementary’  embedding  j:  V —*■  V.  Kunen’s  argument  turned  on  what  seemed 
to  be  a combinatorial  contingency,  but  his  particular  formulation  has  stood  as  the 
upper  bound  for  large  cardinal  hypotheses.  The  initial  guiding  ideas  shaped  and 
delimited  by  a mathematical  result,  hypotheses  just  on  the  verge  of  this  incon- 
sistency were  subsequently  analyzed,  as  well  as  the  weaker  n-huge  cardinals  and 
Vopenka ’s  Principle,  to  chart  the  terrain  down  to  the  extendible  cardinals.  The 
supercompact  cardinals  in  particular  became  prominent  as  a source  of  new  com- 
binatorics and  relative  consistency  results.  Also,  when  refinements  of  elementary 
embedding  in  the  form  of  extenders  were  formulated,  weakenings  of  supercom- 
pactness in  the  form  of  strong , Woodin,  and  superstrong  cardinals  came  to  play 
crucial  roles  in  later  developments.  (These  topics  are  developed  in  Chapter  5.) 

With  this  charting  out  of  the  higher  infinite,  the  extensive  research  through 
the  1970’s  and  1980’s  considerably  strengthened  the  view  that  the  emerging  hi- 
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erarchy  of  large  cardinals  provides  the  measuring  rod  of  exhaustive  principles 
against  which  all  possible  consistency  strengths  can  be  gauged.  First,  the  various 
hypotheses  though  arising  from  diverse  motivations  and  historical  happenstance 
nonetheless  form  a linear  hierarchy,  one  neatly  delimited  by  Kunen’s  inconsistency 
result.  Typically  for  two  large  cardinal  hypotheses,  below  a cardinal  satisfying  one 
there  are  many  cardinals  satisfying  the  other,  in  a sense  prescribed  by  the  first. 
Moreover,  the  weaker  hypotheses  through  strong  forms  of  measurability  have  been 
bolstered  by  a variety  of  equiconsistency  results  involving  combinatorial  proposi- 
tions low  in  the  cumulative  hierarchy.  In  this  respect,  particularly  intriguing  is  the 
work  on  the  Singular  Cardinals  Problem,  which  showed  that  something  as  basic 
as  rendering  2K  large  for  singular  strong  limit  cardinals  k essentially  requires  large 
cardinals.  Finally,  a variety  of  strong  propositions  have  been  informatively  brack- 
eted in  consistency  strength  between  two  large  cardinal  hypotheses:  The  stronger 
hypothesis  implies  that  there  is  a forcing  extension  in  which  the  proposition  ob- 
tains; and  if  the  proposition  obtains,  there  is  an  inner  model  satisfying  the  weaker 
hypothesis.  Supercompactness  has  often  figured  as  the  upper  bound,  but  some- 
times n-hugeness  and  even  the  hypotheses  just  short  of  Kunen’s  inconsistency 
have  played  this  role.  (This  wide-ranging  exploration  is  the  subject  of  volume  II.) 

If  set  theory  serves  as  an  open-ended  framework  for  mathematics,  as  an 
autonomous  field  of  mathematics  it  has  become  a remarkably  successful  investi- 
gation of  well-foundedness,  in  large  measure  because  large  cardinals  have  been 
found  to  provide  an  elegant  and  fully  sufficient  superstructure  for  the  study  of 
consistency  strength. 

Determinacy 

One  of  the  great  successes  for  large  cardinals  has  to  do  with  perhaps  the  most 
distinctive  and  intriguing  development  in  modem  set  theory.  Although  the  deter- 
minacy of  games  has  roots  as  far  back  as  Zermelo  [13],  the  concept  for  infinite 
games  only  began  to  be  seriously  explored  in  the  1960’s  when  it  was  realized 
that  it  led  to  the  regularity  properties  for  sets  of  reals.  Jan  Mycielski  and  Hugo 
Steinhaus  in  their  [62]  proposed  the  Axiom  of  Determinacy,  at  least  for  some  inner 
model  since  it  contradicts  the  Axiom  of  Choice.  Then  in  1967  Solovay  made 
an  initial  connection  with  large  cardinals  and  David  Blackwell  [67]  with  methods 
of  descriptive  set  theory.  Investigating  further  consequences  of  determinacy,  fine 
mathematicians  like  Solovay,  Martin,  Yiannis  Moschovakis,  Kunen,  and  Alexan- 
der Kechris  soon  established  an  elaborate  web  of  connections  in  the  unabashed 
pursuit  of  structure  for  its  own  sake.  Determinacy  hypotheses  seemed  to  settle 
many  questions  and  provide  new  modes  of  argument,  leading  to  an  opaque  real- 
ization of  the  old  Cantorian  initiatives  concerning  sets  of  reals  and  the  transfinite 
with  determinacy  replacing  well-ordering  as  the  animating  principle.  By  the  late 
1970’s  a more  or  less  complete  theory  for  the  projective  sets  was  in  place,  and 
with  this  completion  of  a main  project  of  descriptive  set  theory  attention  began  to 
shift  to  questions  of  overall  consistency. 
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Martin  [70]  had  early  on  shown  that  the  existence  of  a measurable  cardinal 
implies  the  determinacy  of  games  for  analytic  sets,  and  through  the  1970’s  he 
established  results  equating  many  measurable  cardinals  with  levels  of  a difference 
hierarchy  for  analytic  sets  and  then  showed  that  a large  cardinal  hypothesis  near 
Kunen’s  inconsistency  implied  determinacy  at  the  next  projective  level.  Then  in 
the  mid-1980’s  Matthew  Foreman,  Menachem  Magidor,  and  Shelah  made  a major 
breakthrough  about  strong  large  cardinal  hypotheses,  and  although  not  directly 
involving  determinacy  Martin,  John  Steel,  and  Hugh  Woodin  were  able  to  build 
on  this  to  establish  the  consistency  of  the  Axiom  of  Determinacy  relative  to  large 
cardinals.  Woodin  in  fact  established  that  the  Axiom  of  Determinacy  is  equiconsis- 
tent  with  the  existence  of  infinitely  many  Woodin  cardinals,  pinpointing  the  axiom 
in  consistency  strength  above  measurable  cardinals  but  far  below  supercompact 
cardinals.  This  unifying  result  was  a resounding  triumph  for  the  modem  methods 
of  set  theory  and  an  unexpected  affirmation  of  the  relevance  of  large  cardinals. 
Woodin’ s subsequent  results  about  other  determinacy  hypotheses  and  infinite  com- 
binatorics speak  to  the  great  progress  that  has  been  made  and  the  promise  of  deeper 
insights  to  come.  (These  matters  are  taken  up  in  Chapter  6.) 
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0.  Preliminaries 

This  section  sets  out  the  necessary  mathematical  preliminaries  for  the  text.  Gen- 
erally speaking,  familiarity  is  assumed  with  the  development  of  set  theory  through 
the  basics  about  the  constructible  hierarchy  L and  the  method  of  forcing,  and  with 
the  basic  concepts  and  constructions  of  model  theory.  Nevertheless,  in  taking  a 
historical  approach  well-known  concepts  are  often  formulated  anew,  their  basic 
facts  reviewed,  and  references  provided  as  part  of  the  development.  In  particu- 
lar, a discussion  of  the  forcing  formalism  is  deferred  until  results  achieved  by  that 
method  are  dealt  with  squarely.  In  what  follows,  some  basic  terminology  and  con- 
cepts are  affirmed  whose  contextual  review  would  break  the  pace  of  exposition, 
and  some  standing  conventions  established. 

Set-Theoretic  Notation 

For  the  set  theory,  the  texts  Jech  [03],  Kunen  [80],  Drake  [74]  and  Levy  [79]  each 
provide  the  basic  development  of  the  subject  and  more.  The  first  three  contain  the 
necessary  preliminaries  about  L , the  first  two  such  preliminaries  about  forcing, 
and  the  first  a good  deal  of  information  about  large  cardinals.  denotes  the 
language  of  set  theory:  first-order  predicate  calculus  with  equality  and  the  binary 
predicate  symbol  e.  In  this  language  AC  denotes  the  Axiom  of  Choice,  CH  the 
Continuum  Hypothesis,  and  GCH  the  Generalized  Continuum  Hypothesis.  ZF 
denotes  Zermelo-Fraenkel  set  theory  in  £e,  ZFC  that  theory  with  AC  adjoined, 
and  ZF  and  ZFC  these  theories  with  the  Power  Set  Axiom  deleted. 

The  results  in  this  book  are  theorems  of  ZFC  , 

unless  a different  theory  is  specified  either  at  the  beginning  of  a section  or  statement 
of  a result.  Thus,  by  “class”  is  meant  definable  class,  and  although  ordered  pairs 
or  even  transfinite  sequences  of  classes  may  be  used,  they  will  be  definable  as 
single  classes.  The  set-theoretic  notation  used  in  the  text  is  generally  standard, 
with  the  possible  deviations  stipulated  in  the  following  precis: 

Unless  otherwise  specified  the  first  lower  case  Greek  letters  a,  . . . 

denote  ordinals,  whereas  the  middle  letters  k,  X,  fi,  v,  . . . are  reserved  for  infinite 
cardinals.  This  convention  is  sometimes  extended  to  allow  these  middle  letters  to 
denote  infinite  cardinals  only  in  the  sense  of  some  model,  and  this  should  be  clear 
from  the  context.  Concerning  the  ath  uncountable  cardinal,  tradition  dictates  that 
an  intensional  distinction  be  maintained  by  referring  to  its  ordertype  by  coa  and 
its  cardinality  by  although  this  distinction  is  not  always  sharp  or  illuminating. 
cf(y)  denotes  the  cofinality  of  y,  and  y+  denotes  the  least  cardinal  greater  than  y. 
A cardinal  k is  a strong  limit  iff  for  every  X < k,  2x  < k. 

On  denotes  the  class  of  ordinals,  V the  universe  of  sets,  Va  the  set  of  sets 
of  rank  less  than  a,  and  Hk  the  set  of  sets  hereditarily  of  cardinality  less  than 
k.  The  cumulative  hierarchy  Ua  Va  provides  the  basic  stratification  of  V through 
the  full  exercise  of  the  Power  Set  Axiom,  but  the  Hf  s are  often  more  suitable 
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as  approximations.  Not  only  is  their  use  more  parsimonious  since  HK  is  usually 
much  smaller  than  VK,  but  for  regular  k,  Hk  models  Replacement. 

For  a set  x,  |x|  denotes  its  cardinality,  V(x)  its  power  set,  and  VKx  — 
{y  c x | |y|  < k}.  aP  denotes  cardinal  exponentiation  for  cardinals  a and  f, 
and  aP  — |« | for  arbitrary  a and  ft  unless  it  is  contextually  clear  that  ordinal 
exponentiation  is  meant.  Also,  a<K  = 1J {a.P  \ f < k),  so  that  X<l(  is  the  cardinality 
of  VKX. 

For  a function  /,  dom(/)  denotes  its  domain,  ran (/)  its  range,  f“x  = 
{/(y)  | y e x},  the  image  of  x under  / (after  Whitehead  and  Russell),  and 
f\x  — /fl  (x  x V),  the  restriction  of  f to  x.  More  generally,  for  a binary  relation 
R,  R“x  = {b  | in  e x((a,b)  e R)}.  -vx  denotes  the  collection  of  functions: 
y -¥  x,  so  that  XK  is  the  cardinality  of  KX,  and  <ax  — <cl  Px.  For  1 < n < a>, 
an  n-tuple  is  regarded  alternately  as  a member  of  some  n-fold  Cartesian  product 
X i x ...  x X„  (defined  recursively  as  (X  \ x . . . x X„_i)  x X„  for  n > 1),  or  as 
a sequence,  i.e.  a function  with  domain  n.  For  example,  kX  x Y for  0 < k < co 
is  viewed  as  consisting  of  («o, . . . , Uk-\,  uf)  such  that  n;  e X for  i < k and 

Ufc  e Y . Concatenation  and  initial  segment  have  the  expected  meaning  for  n- 

tuples.  For  finite  sequences  s,  |s|  is  a convenient  way  of  denoting  its  length.  For 
a function  f and  an  n-tuple  {x\, ...  ,xn)  in  its  domain,  f(x\, . . . , x„)  is  written 
for  /((x i,  ...,x„». 

x c y denotes  inclusion  and  x C y proper  inclusion,  with  3 and  D having 
the  derived  meanings,  x — y denotes  set  subtraction,  x A y the  symmetric  difference 
(x  — y)U(y — x),  and  tc(x)  the  transitive  closure  of  x,  the  smallest  transitive  set  3 x. 
For  x c On,  ot(x)  denotes  its  ordertype;  min(x)  = fix  when  x ^ 0;  sup(x)  = Ux; 
max(x)  = Ux  when  Ux  e x;  [x]“  = {y  c x | ot(y)  = a};  [x]<a  = 

so  that  VKX  = [A.]<K;  [x]-“  = anc^  an  initial  segment  of  x is  a set  of 

form  x fl  a for  some  a.  Furthermore,  if  {a\,  . . . , a„)  is  used  to  denote  a member 
of  [x]<w,  it  is  with  the  understanding  that  a\  < ...  < an,  and  for  f a function 
with  domain  C [x]<<M,  /(a  i, . . . , a„ ) is  written  for  /({aq,  . . . , a„}).  In  all  of  these 
notations  proper  classes  are  allowed  when  the  resulting  expression  is  formalizable, 
i.e.  it  again  denotes  a class. 

Closed  Unbounded  and  Stationary  Sets 

For  X C On,  y is  a limit  point  of  X iff  |J(Xny)  = y > 0.  C is  closed  unbounded 
in  8 iff  C is  an  unbounded  subset  of  8 containing  all  its  limit  points  less  than  8. 
For  regular  v < 8,  C is  v-closed  unbounded  in  8 iff  C is  an  unbounded  subset 
of  8 containing  all  its  limit  points  less  than  8 of  cofinality  v.  For  limit  ordinals 
8,  S is  stationary’  in  8 iff  S c <5  and  S fl  C f 0 for  any  C closed  unbounded  in 
8.  The  “in  8 ” is  deleted  when  clear  from  the  context.  If  {Xa  \ a < 8)  e sV(S), 
then  its  diagonal  intersection  is  {£  < 8 \ % e Dff<f  denoted  by  For 

X C On  and  /:  X — » On,  / is  regressive  iff  f(a ) < a for  every  a e X — {0}. 
The  following  is  basic: 
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0.1  Proposition.  Suppose  that  X > a>  is  regular. 

(a)  If  y < X and  {Ca  \ a < y)  is  a sequence  of  sets  closed  unbounded  in  X, 
then  P|0  is  closed  unbounded  in  X. 

(b)  if  (Ca  | a < X)  is  a sequence  of  sets  closed  unbounded  in  X,  then  its 
diagonal  intersection  A a<xCa  is  closed  unbounded  in  X. 

(c)  (Fodor  [56])  If  S is  stationary  in  X and  f\S^rX  is  regressive,  then  there 
is  an  a < X such  that  /_1({a})  is  stationaiy  in  X. 

(d)  If  v < X is  regular,  Scg  < X | cf(£)  = v)  is  stationary  in  X,  and  C is 

v-closed  unbounded  in  X,  then  S fl  C 0.  H 

Filters  and  Ideals 

For  a non-empty  set  S,  F c V(S)  is  a filter  (resp.  ideal)  over  S iff  F is  a 
proper,  non-principal  filter  (resp.  ideal)  on  the  Boolean  algebra  V(S).  The  previous 
sentence  exemplifies  the  distinction  intended  between  “over”  and  “on”.  For  E c 
V(S),  the  filter  (resp.  ideal)  over  S generated  by  E is  {X  c S \ 3Y  e E(Y  C X)} 
(resp.  {X  C S | 3F  e E(X  C F)}),  provided  that  it  is  proper  and  non-principal.  A 
filter  (resp.  ideal)  F over  S is  X-complete  iff  for  any  y < X and  {Xa  \ a < y}  c F, 
a<y  Xa  e F (resp.  [Ja<  Xa  e F).  U is  an  ultrafilter  over  S iff  it  is  a maximal 
filter  over  S,  i.e.  for  any  X c.  S either  X e U or  else  .S'  — X e U . A filter  F over 
S is  uniform  iff  |X|  = |S|  for  any  X e F. 

As  a convenient  convention  it  is  assumed  that  for  a filter  F over  a cardinal  X 
the  final  segments  X — cn  = {f|Qi<£  < X]  e F for  every  a < X.  Concomitantly  it 
is  assumed  that  for  an  ideal  I over  a cardinal  X the  initial  segments  a = {£  | § < 
a}  e I for  every  a < X.  For  a regular  cardinal  X > w,  Cx  denotes  the  closed 
unbounded  filter  over  X,  the  filter  generated  by  the  closed  unbounded  subsets  of 
X,  which  is  A-complete  by  0.1(a).  Corresponding  to  it  is  NS^,  the  non-stationary 
ideal  over  X,  the  A-complete  ideal  of  non-stationary  subsets  of  X. 

Model-Theoretic  Notation 

For  the  model  theory,  familiarity  is  assumed  with  the  basic  notions  of  structures 
and  satisfaction,  elementary  equivalence,  substructure  and  embeddability,  com- 
pactness, Skolem  hull  arguments,  and  the  ultraproduct  construction  (although  these 
last  two  are  reviewed  below),  as  well  as  with  the  formalizability  of  the  satisfaction 
relation  for  set  structures.  The  standard  text  for  model  theory  is  Chang-Keisler 
[90],  and  the  texts  Devlin  [84],  Kunen  [80],  Moschovakis  [80],  and  Drake  [74] 
work  out  set-theoretic  formalizations. 

For  a formula  <p  of  Ce,  <pM  denotes  the  relativization  of  the  quantifiers  to 
the  class  M,  defined  by  recursion  on  the  complexity  of  tp.  This  then  extends  to 
definable  terms  and  classes  t to  yield  tM . More  generally,  for  M a structure  and  t 
a definable  term  in  the  corresponding  language,  tM  denotes  the  interpretation  of  t 
in  M.  The  notation  t{v\, . . . , vn)  for  a term  indicates  that  its  variables  are  among 
vi, . , v„,  and  <p( v\, . . . , v„)  for  a formula  indicates  that  its  free  variables  are 
among  v\, ...  ,vn.  For  convenience,  variables  are  usually  indexed  as  here  from 
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1,  but  on  occasion,  from  0.  In  any  case  the  particular  indexing  is  not  restrictive, 
since  just  as  (p  should  be  regarded  as  a meta-variable  for  formulas,  so  too  should 
the  v^s  be  regarded  as  meta-variables  for  formal  variables.  For  t(vi,  . . . , v„) 
and  <p(v  i, . . . , i>„)  as  above  and  for  terms  t\, ...  ,tn,  t(t\, ...  ,t„)  is  the  result  of 
replacing  each  occurrence  of  Vj  by  tj,  and  <p(h,  ...,tn)  the  result  of  replacing 
each  free  occurrence  of  v,  by  t,,  this  allowed  only  when  no  variable  in  any  f,- 
becomes  bound  by  a quantifier  of  <p.  Finally,  M |=  <p[x \, . . . , xn\  indicates  that 
<p  is  satisfied  in  the  structure  M with  the  variable  assignment  taking  i>;  to  x,  in 
the  domain  of  M,  and  tM(x i, ... , xn)  denotes  the  interpretation  of  t under  this 
assignment.  |=  is  extended  to  collections  of  formulas  as  in  Ad  [=  ZFC  with  the 
expected  meaning. 

For  a theory  T,  Con (T)  should  be  a formal  assertion  of  its  consistency  in 
terms  of  an  arithmetization  of  its  language.  For  theories  7j  and  Tj,  Tj  is  consis- 
tent relative  to  7j  iff  Con(7j)  —>  Con(T2);  and  T\  and  Tj  are  equiconsistent  iff 
Con(Ti)  **  Con(T2).  However,  such  assertions  are  better  construed  as  part  of  the 
mathematical  English,  the  proofs  in  each  case  providing  the  sense. 

Although  these  conventions  are  maintained  when  issues  of  satisfaction  and 
definability  are  discussed,  specific  formulas  of  Ce  in  our  arguments  are  rendered  as 
usual  in  a mathematical  English  with  the  assignment  of  sets  to  variables  indicated 
by  a direct  substitution  into  the  formula.  Even  when  structures  are  being  discussed 
with  some  care,  this  conflating  of  syntax  and  semantics  may  occur  to  the  right  of 
a |=;  formulas  are  then  set  off  from  the  rest  of  the  text  by  the  Quinean  quotes 
r 1 when  confusion  is  possible.  For  a formula  <p,  may  also  denote  its  code 
according  to  some  contextually  established  arithmetization. 

We  work  most  often  with  structures  for  Ce  of  form  (AT,  e fl  (AT  x AT)), 
consisting  of  a domain  together  with  the  real  membership  relation  restricted  to 
it.  For  simplicity  such  a structure  is  usually  denoted  by  (AT,  e),  or  just  AT  when 
contextually  clear.  For  a theory  T of  C€,  AT  is  an  e-model  ofT  iff  (AT,  e)  is  a 
model  of  T . We  also  work  with  expanded  structures  of  form  (AT,  e fl  (AT  x AT),  R) 
where  R c AT,  denoted  by  (AT,  e,  R).  It  is  then  to  be  understood  that  R is 
interpreting  a new  unary  predicate  symbol,  although  in  the  set-theoretic  context 
x e R is  often  written  instead  of  R(x).  Similar  remarks  apply  to  further  expanded 
structures.  When  there  is  need  to  be  explicit,  C*{R\, . . . , Rn)  denotes  the  language 
Ce  of  set  theory  expanded  by  constants  x for  every  x e X and  predicate  symbols 
R\ Rn  of  specified  arity,  with  C&{R\.  . . . , R„)  being  C%(R\ Rn ). 

We  often  consider  structures  with  domains  that  are  proper  classes,  like 
V itself.  The  previous  conventions  apply,  but  there  is  an  important  caveat: 
By  the  Godel-Tarski  undefinability  of  truth  argument  (Jech  [03:  162])  the  sat- 
isfaction relation  f<p\,(x \,...,xn))  e S iff  V 1=  <p\x\, . . . ,xn\  is  formally 
undefinable  in  ZF,  and  so  also  is  the  general  satisfaction  relation  for  proper 

classes  (l(p\  {z\, . . . , Zk),lf\  {xi ,xn))  e S+  iff  [y  | <p\y,  zu  . . . , Zk]}  1= 

ifr\xi, , x„\.  This  is  the  source  of  possible  unformalizability  in  the  text,  and  the 
issue  is  discussed  as  it  arises. 
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Hierarchies  of  Formulas 

Several  hierarchies  of  formulas  will  figure  in  analyses  of  definability,  particularly 
in  absoluteness  and  reflection  arguments.  For  any  (first-order)  extension  C of 
C€,  the  Levy  hierarchy  of  formulas  of  C is  formulated  as  follows:  A formula  is 
Xo  and  /7o  if  its  only  quantifiers  are  bounded,  i.e.  can  be  rendered  as  Vd  e w 
or  3d  e w.  Recursively,  a formula  is  Sn+\  if  it  is  of  the  form  3i>i . . . 3 vkcp 

where  (p  is  77„,  and  77„+i  if  it  is  of  the  form  Vd3 . . . Vvk(p  where  q>  is  Sn.  For 

convenience  this  terminology  is  extended  by  stipulating  that  every  formula  is  E0) 
and  nm.  The  classification  of  definable  concepts  in  this  hierarchy  depends  on  the 
ambient  theory.  For  a theory  T of  C,  a formula  (p  is  27 J iff  for  some  27„  formula 
<p',T  \-  <p  <p';  and  similarly  for  77j.  A formula  is  A¥  iff  it  is  both  27j  and 
nf . This  terminology  is  extended  to  classes  through  the  analysis  of  their  defining 
formulas. 

I7ZF  and  77, fF  formulas  are  equivalent  to  formulas  with  blocks  of  like  quanti- 
fiers contracted  into  one  through  applications  of  the  Pairing  Axiom.  Also,  bounded 
quantification  does  not  add  to  complexity  in  ZF: 

(i)  If  (p  is  I7ZF,  so  is  3d  e wcp  and  Vd  e u><p  . 

(ii)  If  <p  is  /7ZF,  so  is  3d  e w <p  and  Vd  e w<p  . 

To  show  this,  assume  inductively  that  (i)  and  (ii)  hold  with  n replaced  by  any 
k < n.  (ii)  follows  from  (i)  by  taking  negations,  and  the  first  part  of  (i)  is 
immediate.  For  the  second  part  of  (i),  <p  can  be  taken  to  be  of  form  3do  f where 
i//  is  77„_i.  Then  by  Replacement, 

Vd  e w3vof  3diVd  e id3do  e vif  , 

and  the  latter  formula  is  27;fF  by  induction. 

A typical  analysis  of  transfinite  recursion  leads  to  a useful  observation: 

0.2  Lemma.  rrank(D0)  = Di1  is  AfF,  and  rV„0  = d^  is  nf¥ . 

Proof.  Let  <p(f)  be  the  formula 

f is  a function  A Vx  e dom(/)(x  c dom(/)  A f(x)  = lj{/(y)  +1  | y e x})  . 

This  asserts  that  / must  satisfy  the  usual  recursive  definition  for  the  rank  function 
on  a transitive  domain.  Hence, 

rank(Do)  = t>i  iff  3/(<p(/)  A (d0,  vx)  e /) 

iff  V/(<p(/)  A Do  e dom(/)  -a  (d0,  di)  e /)  . 

<p(f)  is  Aq¥,  and  so  the  first  result  follows.  For  the  second,  note  that 

Vv0  — vi  iff  Vd2(d2  e di  aa  3d3  e D0(rank(D2)  = d3))  . 

The  latter  is  77jZF  by  previous  remarks  about  absorbing  bounded  quantifiers.  H 
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A well-known  observation,  established  by  induction  on  formula  complexity, 
is  that  S0  formulas  are  absolute  for  transitive  structures,  i.e.  if  <p(v\, . . . , vn)  is 
S0,  M is  transitive,  and  x\, ...  ,x„  e M,  then 

{M,  e)  [=  <p[x\, ...  ,x„\  iff  {V,&)\=(p[xi ,xn]. 

Moreover,  Z\fF  formulas  are  absolute  for  transitive  e-models  of  ZF,  i.e.  if 
<p(v i, . . . , v„)  is  Af,  M is  such  a model,  and  x\, . . . , xn  e M,  then 

{M,  e)  [=  <p[x\, ...  ,x„\  iff  {V,e)\=(p[x\ ,xn]. 

This  can  be  illustrated  in  terms  of  the  proof  of  0.2:  If  (M,  e)  |=  rank(x)  = a, 
then  by  the  S\  formulation,  in  M there  is  an  / satisfying  <p  such  that  /(x)  = a, 
and  hence  this  holds  in  V M,  so  that  rank(x)  — a in  V,  i.e.  there  is  upward 
persistence.  If  rank(x)  = a,  then  by  the  FT)  formulation,  for  any  / satisfying  cp 
with  x € dom(/)  we  have  /(x)  = a,  and  hence  this  holds  in  M C V so  that 
M |=  rank(x)  = a,  i.e.  there  is  downward  persistence.  We  often  use  the  basic 
observation  that  if  a (p  is  upward  persistent,  then  so  is  3 v<p,  and  if  <p  is  downward 
persistent,  then  so  is  Vvcp. 

For  more  information  on  the  Levy  hierarchy  see  Levy  [65],  Jech  [03: 183ff], 
Devlin  [84: 27ff],  or  Drake  [74:  75ff]. 

Concerning  the  satisfaction  relation,  the  latter  two  texts  provide  (pp.  3 Iff  and 
pp.  89ff  resp.)  formalizations  of  the  relation  for  set  structures  that  are  Z\fF,  and 
hence  absolute  for  transitive  e -models  of  ZF.  We  will  use  the  following  strategem: 
If  M is  transitive,  y e M is  transitive,  <p( vu  ■ ■ ■ , v„)  is  a formula  of  £e,  and 
x\, ...  ,xn  e y,  then 

(y,  e>  N <p[x i xn]  iff  ( M , e)  \=  <p(y)[x \, . . . , x„\  . 

Thus,  although  (M,  e)  may  not  be  a model  of  ZF,  it  can  cast  the  satisfiability  of 
<p  in  (y,  e>  in  the  absolute  terms  of  the  So  formula  (p^f 

For  proper  classes,  the  unformalizability  of  the  satisfaction  relation  was  al- 
ready mentioned.  It  will  be  useful  to  know  that  for  any  particular  n and  transitive 
class  M,  the  satisfaction  relation  \="M  for  M restricted  to  Sn  formulas  is  formal- 
izable  in  ZF:  For  n — 0,  using  the  aforementioned  absoluteness  of  So  formulas 
for  transitive  structures, 

\=°m  ^[*i, . . . , Xk\  iff  <p( i>i, . . . , Vk ) is  Sq  and  3 y e M(y  is  transitive 
Axj  — ,xk  e y a (y,  e)  (=  <p[x i , . . . , x*])  , 

and  recursively, 

N'm1  (p[xx, . . . , xk\  iff  <p(v  i,...,vk)  is  S„+1,  say  for  simplicity 

3^+1 . . . 3 Vk+r-'fCvx, ....  vk+r)  where  f is  Sn, 
and  3y  ] . . . lyr^(\=nM  f[xu  ...,xk,  >'i y,.])  . 

General  higher-order  languages  have  typed  variables  of  every  finite  type  (or 
order),  quantifications  of  these,  and  beyond  the  atomic  formulas  specified  by  the 
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language,  X e Y and  X = Y for  any  typed  variables  X and  Y.  In  the  intended 
semantics,  if  D is  the  domain  of  a structure,  type  1 variables  play  the  usual  role 
of  first-order  variables,  type  2 variables  range  over  V(D),  and  generally,  type 
i + 1 variables  range  over  V' (D)  where  V denotes  i iterations  of  the  power  set 
operation.  A formula  is  77“  iff  it  starts  with  a block  of  universal  quantifiers  of 
type  m + 1 variables,  followed  by  a block  of  existential  quantifiers  of  type  m + 1 
variables,  and  so  forth  with  at  most  n blocks  in  all,  followed  afterwards  by  a 
formula  containing  variables  of  type  at  most  m + 1 and  quantified  variables  of 
type  at  most  m.  A formula  is  X”  iff  it  starts  instead  with  existential  quantifiers. 
Of  course,  formulas  containing  only  type  1 variables  can  be  construed  as  the  usual 
first-order  formulas.  Note  that  this  classification  of  formulas  is  cumulative  because 
of  the  “at  most”:  any  TTf  or  S’"  formula  is  also  77J  and  S‘s  for  any  r > m,  or 
/■  = m and  s > n.  Over  the  structures  that  will  be  considered  these  formulas  will 
always  be  equivalent  to  like  formulas  where  blocks  of  like  quantifiers  have  been 
contracted  into  one. 

Well-F  oundedness 

For  a binary  irreflexive  relation  R with  field  a set  A, 

R is  well-founded  iff  any  non-empty  Y C.  X has  an  R -minimal  element . 

It  is  not  assumed  that  R is  transitive.  The  following  is  basic: 

0.3  Lemma.  For  a binary’  relation  R with  field  X,  R is  well-founded  iff  there  is 
a p\  X — > On  which  is  order-preserving,  i.e.  if  (x,  y)  e R,  then  p(x)  < p(y).  In 
particular,  1 R is  well-founded 1 is  AfF  and  hence  absolute  for  transitive  e-models 
of  ZF.  ' H 

In  the  substantive  direction,  define  a rank  function  p by  recursion: 

P 00  = sup({p(x)  + 1 | {x,  y)  e R})  . 

The  definition  of  well-foundedness  is  II  fF  and  the  rank  function  formulation,  XfF . 
Also  basic  is  the  Collapsing  Lemma: 

0.4  Lemma  (Mostowski  [49],  Shepherdson  [51]).  Suppose  that  (M,  £,...}  is  a 
(possibly  proper  class)  structure  with  E a binary  relation  on  M satisfying: 

(a)  E is  well-founded; 

(b)  ( M , E)  is  extensional,  i.e.  if  a,  b e M and  x E a iff  x E b for  every 
x e M,  then  a = b;  and 

(c)  E is  set-like,  i.e.  {x  \ x E a}  is  a set  for  every  a e M. 

Then  there  is  a unique  isomorphism  tc\  (M , £,...}  — ► ( M , €,...}  where  M is 
transitive.  H 

(M,  e, . . .}  is  the  transitive  collapse  of  ( M , e, ...);  it  is  defined  by  recursion 
on  E by:  tt(x)  — (7r(y)  | y E x]  for  x e M.  The  . . . accommodates  further 
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relations  and  functions.  For  example,  for  (Af,  E,  R)  where  R C " Af  there  is  an 
S c "Af  such  that  jr : ( Af , E , R)  — > (Af,  e,  S)  satisfies  x e R iff  n(x)  e 5 for 
x e "Af. 

Conditions  (b)  and  (c)  are  usually  immediate  for  applications  of  the  lemma  in 
this  text,  the  former  since  the  (Af,  E. . . .}  being  considered  is  usually  elementarily 
equivalent  to  some  (X,  e,  . . .)  where  X is  transitive,  (a)  is  more  consequential,  but 
the  lemma  will  often  be  invoked  when  (Af,  E, . . .)  is  an  elementary  substructure  of 
some  (X,  e, . . .)  when  it  too  is  immediate.  As  applications  of  the  lemma  become 
routine,  these  conditions,  and  indeed  the  lemma,  will  be  mentioned  less  and  less. 

Skolem  Hull  Arguments 

To  specify  the  approach  and  terminology,  by  a Skolem  hull  argument  is  meant  an 
argument  based  on  some  version  of  the  following  construction,  detailed  in  Chang- 
Keisler  [90:  §3.3].  Let  Ad  = (Af, . . .)  be  a structure  for  a language  C.  For  a 
formula  <p(  Vo, . . . , v„)  of  C,  /:  "Af  -»  Af  is  a Skolem  function  for  (p  iff  for  any 
x\, ...  ,x„  e Af, 

Ad  \=  3vQ(p[x\, . . . , x„]  implies  that  Ad  1=  <p\f(x i, . . . , x„),  xi, . . . , xn]  . 

Such  functions  can  be  provided  through  a well-ordering  of  Af  by  taking  least 
witnesses,  and  so  if  Ad  has  a definable  well-ordering  of  Af,  there  are  definable 
Skolem  functions  for  every  <p.  A complete  set  of  Skolem  functions  for  Ad  is  the 
closure  under  functional  composition  of  some  collection  {fv  \ q>  is  a formula} 
where  fv  is  a Skolem  function  for  <p.  Such  a collection  has  cardinality  \C\,  the 
number  of  formulas  of  C. 

Let  M*  be  the  expansion  (M,  fa)a< \c\,  where  /„:  k(a)M  — > Af  and  {fa  \a  < 
\C\)  is  a complete  set  of  Skolem  functions  for  A4.  The  function  symbols  corre- 
sponding to  the  fa’s  in  the  expanded  language  are  the  Skolem  terms.  For  any 

X c Af, 

{/<*(•* i,  • • • , **(«>)  I « < |/3|  a Xu  . . . , xjt(„)  e X} 

is  the  domain  of  a substructure  of  Ad*,  and  its  reduct  H(X)  to  the  original  language 
is  the  Skolem  hull  of  X in  M with  respect  to  {fa  \ a < \C\}.  By  the  usual 
(Tarski)  criterion,  H(X)  -<  M,  i.e.  H(X)  is  an  elementary  substructure  of  M. 
This  provides  a useful  version  of  the  Lowenheim-Skolem  Theorem: 

0.5  Theorem  (Tarski- Vaught  [57:  92]).  Suppose  that  M = (Af,  . . .)  is  a structure 
for  a language  C,  and  X C Af.  Then  there  is  an  Ado  = (Afo, . . .)  -<  Ad  with 
cardinality  at  most  |X|  + \C\  such  that  X C Mq.  H 

Beyond  this  result,  a complete  set  of  Skolem  functions  provides  a uniform 
way  of  generating  many  elementary  substructures  of  a given  structure. 
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Ultraproducts 

The  basic  ultraproduct  construction  is  reviewed  here  partly  in  order  to  establish 
the  notation  (see  Chang-Keisler  [90:  §4.1]  for  details).  Let  U be  an  ultrafilter 
over  a set  S,  and  for  each  i e S let  Mi  = ( M, , . . .}  be  a structure  for  some  fixed 
language  C.  A/,-  denotes  the  Cartesian  product  of  the  Mi’s,  i.e.  the  collection  of 
all  functions  / with  domain  S satisfying  f(i)  e Mt  for  i e S.  For  /,  g e Hs  Af,- 
define 

f=ug  iff  He  S | f(i)  = £(/)}  e U . 

Then  =v  is  an  equivalence  relation  on  ]~[.s  Af;,  so  let  (/);/  denote  the  correspond- 
ing equivalence  class  of  / and  set  UsMi/U  = {(f)u  | / e Us  Mf.  Finally, 
define  the  ultraproduct  of  the  Mi’s  by  U,  a structure  for  £ denoted  by  Hs  Mi/U, 
as  follows:  (i)  its  domain  is  ]”[.?  Mj/U,  and  (ii)  for  any  u-ary  predicate  symbol  in 
£ interpreted  in  Mi  by  the  «-ary  relation  R , C "M,-,  the  interpretation  Ry  in  the 
ultraproduct  is  defined  by: 

(ifi)u,  • ■ • , (fn)u)  e Ru  iff  {i  e S \ (Mi), /„(/)>  e R,}  e U . 

The  interpretations  of  the  function  and  constant  symbols  are  defined  analogously. 
Note  that  if  each  Rj  is  the  real  membership  relation  restricted  to  M,-,  then  the 
corresponding  interpretation  Ev  in  the  ultraproduct  is  given  by: 

(f)u  Ev  ( g)v  iff  {i  e S | f(i)  e g(i)}  e U . 

The  following  is  the  basic  Los’s  Theorem'. 

0. 6  Theorem  (Los  [55]).  For  a formula  <p(vi, . . . , vn)  and  fi, , fn  en  sMi. 

T\sMi/U^<p[(Mu....Afn)u]  iff 

[i  e 5 | Mi  |=  <p[fi(i),...,  e U . H 

The  proof  is  by  induction  on  the  complexity  of  cp;  the  Axiom  of  Choice  is  needed 
at  the  existential  quantifier  step. 

When  there  is  a fixed  M — ( M , . . .)  such  that  each  Mi  is  Ad,  the  ultraproduct 
is  the  ultrapower  of  M by  U,  denoted  SM/U.  In  this  case  there  is  by  0.6  an 
elementary  embedding  j:  M -*  SM/U  given  by:  j(x)  — ( fx)u  f°r  x e Af, 
where  fx  is  the  constant  function:  S ->  {*}.  Ultrapowers  of  V itself  are  formulated 
in  §5,  and  variations  considered  in  subsequent  sections. 

Direct  Limits 

The  direct  limit  construction  is  formulated  here  for  elementary  embeddings  of 
structures;  tailored  versions  with  obvious  modifications  are  used  several  times  in 
different  contexts.  A directed  set  is  a partially  ordered  set  ( S , <)  such  that  for  any 

1,  j e S there  is  a k e S such  that  i < k and  j < k.  A directed  system  is  a pair 
((Mi  | i e S),  (fj  | i < j ))  where  ( S , <}  is  a directed  set,  each  Mi  is  a structure 
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for  a fixed  language  C,  and  each  fj\  At ,■  -<  Mj  is  an  elementary  embedding  such 
that  fik  = fjk  ° fij  for  i < j < k (so  that  each  fa  is  the  identity  on  At,). 

A direct  limit  of  such  a system  is  a structure  M for  C for  which  there  are 
elementary  embeddings  f:  Mj  -<  M for  i e S with  f)  — fj  o fj  for  i < j,  such 
that:  for  each  x in  the  domain  of  At,  x e ran (/)■)  for  some  i e S.  The  following 
proposition  gives  the  essence  of  this  concept: 

0.7  Proposition.  Suppose  that  {{Mi  \i  e S),  {fij  \ i < j))  is  a directed  system  and 
M is  a direct  limit  with  corresponding  embeddings  f:  Mi  -<  M.  Assume  that  J\[ 
is  a structure  such  that  there  are  elementary  embeddings  g,-:  Mi  ~<  M satisfying 
g,  = gj  o fj  for  i < j.  Then  there  is  an  elementary  embedding  g:  M -<  Af  such 
that  gi  — go  ft.  H 


To  define  g,  for  x in  the  domain  of  M,  say  x = f(x)  for  some  i e S and 
x in  the  domain  of  Mi,  set  g(x)  — gj (x).  “Une  chasse  sur  les  diagrammes” 
confirms  that  the  g is  a well-defined  elementary  embedding. 

0.7  implies  that  any  two  direct  limits  of  a directed  system  are  isomorphic, 
and  so  one  can  speak  of  the  direct  limit  - once  it  is  established  that  there  is  one 
at  all,  which  can  be  done  as  follows: 

Suppose  that  {{Mj  \ i e S),  {f  j \ i < j))  is  a directed  system  with  M,-  the 
domain  of  Mj  for  i e S.  Set  B = IJ/esf'  ! x Mj,  a union  of  disjoint  copies  of  the 
Mi’s.  Define  a binary  relation  ~ on  B by: 

(f,  x)  ~ ( j , y)  iff  3k  e S(i  < k A j < k A fik{x)  = fjk(y))  ■ 

It  is  simple  to  check  that  ~ is  an  equivalence  relation,  so  letting  [(i,  x>]  be  the 
corresponding  equivalence  class  of  {i,x),  set 

M = fi/~  = {[(/',  x>]  | (i,x)  e B } . 

To  expand  M into  a structure  for  C,  suppose  for  instance  that  C has  an  n-ary 
predicate  symbol  interpreted  in  Mj  by  R,  c "Mj  for  i e S.  Then  define  a 
corresponding  R c nM  as  follows:  Given  [(/ 1 , xi>], . . . , x„)]  e M,  first  find 

a k e S such  that  is  < k for  1 < s < n.  Then  stipulate  that 

([(i'i,  xi)],  — [(/„,  x„>]>  e R iff  {flk(x  i), . . . , fnk{xn ))  e Rk  ■ 

It  is  simple  to  check  that  R is  well-defined.  Interpretations  of  function  and  con- 
stant symbols  are  defined  analogously.  Let  M — { M , R, . . .)  be  the  resulting 
structure  for  C.  For  each  i e S define  f:  Mj  -a  M by:  f (x)  = [(Lx)].  Then 
straightforward  arguments  show  that  At  is  a direct  limit  with  f’s  the  verifying 
embeddings. 

Measure  and  Category 

These  preliminaries  are  concluded  by  reviewing  some  basic  concepts  used  in  the 
study  of  the  continuum.  E denotes  the  set  of  reals,  formalized  as  the  Dedekind 
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completion  of  the  rationals.  However,  members  of  OJoj  or  V(co)  are  also  called 
“reals”  following  set-theoretic  practice.  For  discussing  the  structural  properties  of 
E in  set  theory  it  is  convenient  to  work  with  10 co  instead,  topologized  by  taking  as 
the  basic  open  sets 

0(s)  = [f  € aco  I S c /} 

for  s e <(0co.  This  is  known  as  Baire  space,  again  denoted  by  wco.  This  space 
is  homeomorphic  to  the  irrationals,  and  the  essential  features  of  the  structural 
properties  that  will  be  considered  are  preserved  in  this  association  with  E (see  e.g. 
Levy  [79:  V1I§3] ).  Once  and  for  all, 

we  fix  an  enumeration  (s,-  | i e a>)  of  <Mco  such  that  |s,-|  < i 

and  sequences  appear  after  their  proper  initial  segments,  given  in  some  effective 
manner.  Some  simple  observations  are  made  about  the  topology: 

0.8  Exercise  (ZF).  Suppose  that  s,te  <u>co.  Then: 

(a)  0(s)  fl  0(t)  is  either  0,  O(s),  or  O(t). 

(b)  O(s)  — O(t)  is  a disjoint  union  of  basic  open  sets. 

(c)  0(t)  is  clopen,  i.e.  closed  as  well  as  open. 

(d)  Every  open  set  is  a disjoint  union  of  basic  open  sets. 

Hint.  For  (b)  note  that 

0(s)  - 0(t)  = |J{0(s)  O 0(u)  | \u\  = \t\  A u ± t] 

and  apply  (a).  For  (c)  take  s = 0 in  (b).  Finally,  for  (d)  suppose  that  O is  an 
open  set,  say  O — U/eft)  Oisij)  in  terms  of  our  fixed  enumeration.  Set  Xj  = 
0(s,-j)  — U kcj  Then  O = [J  - Xj  is  a disjoint  union,  and  by  repeated 

application  of  (b)  each  Xj  is  in  turn  a disjoint  union  of  basic  open  sets.  H 

These  properties  are  one  of  several  advantages  of  working  with  “’co  and  imply 
that,  chameleon-like,  tuco  is  “zero-dimensional”:  For  0 < k < co  let  *(“&>),  the  k- 
fold  Cartesian  product  of 01  co,  be  topologized  with  the  product  topology,  i.e.  using 
basic  open  sets  of  the  form 


O (s„ ) x •••  x 0(sik)  . 

Then  unlike  for  E,  there  is  a homeomorphism:  M&)  — > k(wco).  In  fact,  any  bijection 
of  co  with  co  ■ k induces  such  a homeomorphism.  In  what  follows,  concepts 
are  described  for  “a > for  notational  simplicity,  but  their  extensions  to  k(a>oo)  are 
assumed  through  these  homeomorphisms  or  directly  in  terms  of  the  corresponding 
product  notions. 

Turning  first  to  the  concept  of  measure,  we  first  review  the  axiomatic  approach 
of  Emile  Borel:  For  any  set  S and  F C V(S), 

F is  a a -algebra  on  S iff  0 e F and  F is  closed  under  the  taking 

of  complements  and  countable  unions  . 
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For  such  F and  any  m:  F ->  [0,  1],  where  [0,  1]  c R is  the  unit  interval  of  reals, 

m is  a (probability)  measure  on  F iff 

(i)  m(0 ) = 0 and  m(S)  = 1 , and 

(ii)  if  {Xn  | n e co)  c F is  a pairwise  disjoint 
collection,  then  m( (J„  X„)  = m(Xn)  . 

With  m(o  in  place  of  R, 

the  Bore l sets  are  the  members  of  the 

a -algebra  B on  “co  generated  by  the  0(i)’s  . 

The  Borel  sets  form  a natural  hierarchy  (see  §12),  and  focal  are  members  of  the 
second  level,  the  Fa  and  G$  sets  in  the  classical  but  persistent  terminology  of 
Hausdorff  [14]: 

A is  Fa  iff  A is  a union  of  countably  many  closed  sets  . 

A is  Gs  iff  A is  an  intersection  of  countably  many  open  sets  . 

To  endow  B with  a measure,  let  m0)  be  that  measure  on  Vico)  satisfying  m^di})  = 
2-(!+1),  so  that  m^(a)  = 2-(,+1)  for  any  a c a>.  Applying  a standard 

measure-theoretic  construction  (see  e.g.  Halmos  [50: 157]),  stipulate  that 

mg  is  the  product  measure  on  B induced  by  mlo  . 

m^(0(s))  = n,<u|2-(s(l)+1)  for  each  5 e <toa>  — [0],  and  is  the  unique 
measure  on  B with  this  property.  With  wco  in  place  of  R,  mg  is  Borel’s  measure 
from  his  [98:46-47],  and  it  was  extended  by  Lebesgue’s  measure  from  his  [02], 
now  of  course  a basic  feature  of  mathematical  analysis.  A simple  way  to  formulate 
his  extension  is  to  stipulate  first  for  N C that 

N is  null  iff  there  is  a Borel  set  X such  that  mg(X)  = 0 and  N C X , 

and  then  for  A C.c°co  that 

A is  Lebesgue  measurable  iff  A A B is  null  for  some  Borel  set  B . 
in  which  case  the  Lebesgue  measure  of  A is 

mL(A)  — m&{B ) . 


Clearly, 

the  Lebesgue  measurable  sets  form  a a -algebra  ML  extending  B , 
and 

m L is  a measure  on  Ml  extending  mg  on  B . 

See  e.g.  Flalmos  [50]  for  the  further  development  of  the  theory.  Two  well- 
known  properties  are  stated  here  for  later  reference.  The  first  is  immediate  from  the 
standard  characterization  of  mj,  in  terms  of  the  inner  and  outer  measures  derived 
from  ms- 
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0.9  Lemma. 

(a)  For  any  Lebesgue  measurable  set  A and  e e K with  e > 0,  there  is  a 
closed  set  C and  an  open  set  O such  that  C C A C O and  mi(0  — C)  < e. 

(b)  For  any  Lebesgue  measurable  set  A,  there  is  an  Fa  set  X and  a G$  set  Y 

such  that  ICACF  and  nii(X)  — ihl(A)  — mi(Y).  H 

(b)  follows  directly  from  (a)  and  highlights  the  significance  of  F„  and  G$ 

sets. 

The  next  theorem  is  the  set-theoretic  formulation  of  the  familiar  Fubini  The- 
orem in  the  case  of  null  subsets  of  the  plane.  In  the  present  context,  Lebesgue 
measure  for  k(fcS)  is  just  the  A'-fold  product  measure  of  mL\  for  A C 2(a)a> ) and 
y e wa>,  temporarily  set  Ay  — {x  \ (x,  y)  e ,4}. 

0.10  Theorem.  Suppose  that  A C 2 (r"a>)  is  Lebesgue  measurable.  Then  A is  a null 
subset  of  2(‘°co)  iff  {y  | Ay  is  not  a null  subset  of  01  co}  is  a null  subset  of  wu>.  H 

The  Baire  property  evolved  from  the  topological  classifications  of  Rene  Baire. 
For  A C a,co, 

int( A)  = Uf  O | O is  open  A OCA},  and 
cl(A)  = fj{C  | C is  closed  A C c A}  . 

int(A)  is  the  interior  of  A,  the  largest  open  set  contained  in  A,  and  cl(A)  is  the 
closure  of  A,  the  smallest  closed  set  containing  A. 

A is  nowhere  dense  iff  int(cl(A))  = 0 . 

A is  meager  {ox  first  category)  iff  A is  a union  of  countably 

many  nowhere  dense  sets  . 

A has  the  Baire  property  iff  A A O is  meager  for  some  open  set  O . 

It  is  simple  to  see  that  if  O is  open,  then  the  closed  set  cl(O)  — O is  nowhere 
dense.  Also,  open  and  meager  sets  have  the  Baire  property  (the  latter  as  0 is 
open).  The  Baire  property  arose  from  considerations  related  to  the  Baire  Category 
Theorem : 

0.11  Theorem  (Baire  [99]).  No  nonempty  open  set  is  meager. 

Proof.  Assume  to  the  contrary  that  O is  a nonempty  open  set  such  that  O — 
U j€lo  Xj  where  each  Xj  is  nowhere  dense.  Define  a sequence  (;)■  | j e a>)  e wa> 
such  that  j < k implies  that  O c (9(s,;)  c 0(sit)  and  |Sj.|  < |s;t|  as  follows: 

Let  io  be  least  such  that  D(s,0)  c O.  Given  ij,  there  must  be  a t c s,-;  such 
that  0{t)  fl  Xj  — 0,  else  cl  (Xj)  c OiSifi  and  Xj  would  not  be  nowhere  dense. 
Let  ij+ 1 be  the  least  i such  that:  0{ s,)  fl  Xj  =0,  s,-  c si( , and  |s,j  > |s(/ 1. 

Clearly,  if  x = [J.  s,.,  then  x € O — [J  - Xj,  which  is  a contradiction.  H 

This  theorem  can  be  seen  as  an  extension  of  Cantor’s  result  that  the  reals  are 
uncountable,  and  the  proof  a generalization  of  his  original  1873  proof.  The  Baire 
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Property  draws  its  life  from  the  theorem:  if  it  failed,  then  by  a homeomorphism 
argument  mco  would  be  meager,  and  hence  any  A c ® co  would  have  the  Baire 
Property. 

The  next  lemma  gives  some  simple  consequences  of  the  definitions: 

0.12  Lemma.  Suppose  that  A C '”o>  has  the  Baire  property.  Then: 

(a)  wco  — A has  the  Baire  property. 

(b)  There  is  a G$  set  X and  an  F„  set  Y such  that  X C A C Y and  Y — X 
is  meager. 

Proof.  Let  O be  open  such  that  A A O is  meager.  For  (a),  note  that 
Cco  - A)  A Cco  - O)  = A A O , 

that  cl(O)  — O is  nowhere  dense,  and  hence  that  (“w  — A)  A (“&>  — cl(<9))  is 
meager. 

For  (b),  first  note  that  since  the  closure  of  a nowhere  dense  set  is  also  nowhere 
dense,  there  is  a meager  F„  set  F such  that  A A O c F . Then  X — O — F is 
Gs,  X c A,  and  A — X is  meager.  By  (a)  there  is  similarly  a G$  set  G such  that 
G c (“&)  — A)  and  (“w  — A)  — G is  meager.  Taking  Y — coa>  — G,  the  proof  is 
complete.  H 

Note  that  (b)  is  analogous  to  0.9(b)  but  with  the  roles  of  the  F„  and  Gs  sets 
reversed.  By  (a)  and  simple  consequences  of  the  definitions, 

the  sets  having  the  Baire  property  form  a a -algebra  extending  B . 

Further  important  analogies  as  well  as  crucial  differences  exist  between 
Lebesgue  measurability  and  the  Baire  property  (see  Oxtoby  [71]  and  Kunen  [84]  ). 
The  following  theorem  is  the  analogue  of  0.10;  like  Lebesgue  measure,  the  topo- 
logical concepts  extend  naturally  to  A' ('"«). 

0.13  Theorem  (Kuratowski-Ulam  [32]).  Suppose  that  A C 2C"o>)  has  the  Baire 
property.  Then  A is  a meager  subset  of  2 (fad)  iff  [y  | Ay  is  not  a meager  subset 
of  Mcoj  is  a meager  subset  of  Mco.  H 


Chapter  1 

Beginnings 


The  beginning  threads  of  the  subject  are  picked  up  in  its  early  history.  §1  discusses 
weak  inaccessibility  and  Mahloness,  concepts  that  arose  in  the  study  of  cardinal 
limit  processes,  and  their  strong  versions,  which  led  to  early  speculations  about 
completeness  and  consistency.  §2  describes  Ulam’s  formulation  of  measurability, 
the  most  prominent  of  all  large  cardinal  hypotheses,  out  of  a measure  problem  for 
sets  of  reals.  In  §3  Godel’s  work  on  L,  the  beginning  of  axiomatic  set  theory  as  a 
distinctive  field  of  mathematics,  is  reviewed  since  in  both  reaction  and  generaliza- 
tion it  shaped  much  of  the  subsequent  work  in  large  cardinals.  §4  discusses  weak 
and  strong  compactness,  concepts  that  emerged  from  Tarski’s  study  of  infinitary 
languages,  and  establishes  Hanf  s result,  that  in  a strong  sense  there  are  many 
inaccessibles  below  a measurable  cardinal.  The  focus  of  §5  is  on  elementary  em- 
beddings and  the  ultrapower  construction:  Scott’s  pivotal  result  that  if  there  is  a 
measurable  cardinal,  then  V ^ L;  the  characterization  of  measurability  in  terms 
of  ultrapowers  and  elementary  embeddings;  and  the  related  notion  of  normality. 
And  finally  §6  discusses  indescribability,  a natural  formalization  of  reflection  phe- 
nomena in  terms  of  higher-order  languages  that  provided  a schematic  approach  to 
comparing  large  cardinals  by  size. 
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1.  Inaccessibility 

That  volume  of  Mathematische  Annalen  containing  Zermelo’s  first  axiomatization 
[08a]  of  set  theory  also  contained  Hausdorff  s wide-ranging  paper  [08]  on  trans- 
finite  ordertypes.  While  Cantor  had  concentrated  his  efforts  on  the  rational  and 
real  ordertypes,  the  second-number  class,  and  of  course,  the  Continuum  Hypoth- 
esis, Hausdorff  extended  mathematical  investigations  into  the  higher  transfinite. 
Deploring  all  the  fuss  made  over  foundations  by  his  contemporaries  (p.  436)  he 
ventured  forth  with  vigor,  pursuing  structure  for  its  own  sake.  His  paper  contains 
the  first  statement  of  the  Generalized  Continuum  Hypothesis,  the  construction  of 
the  rja  sets  - prototypes  for  saturated  model  theory  - and  for  the  first  time,  the 
following  concept  (p.443)  formulated  for  a:  > a>: 

k is  weakly  inaccessible  iff  a:  is  a regular  limit  cardinal  . 

(The  term  “inaccessible”  is  attributed  to  Kuratowski  in  Sierpinski-Tarski  [30]; 
“weakly”  is  appended  for  a later  distinction.) 

Hausdorff  observed  that  such  a k must  satisfy  coK  = k and  is  a natural  closure 
point  for  cardinal  limit  processes.  In  modem  terms,  a simple  argument  shows  that 
if  C is  closed  unbounded  in  k,  then  so  is  C'  = [a  e C \ \C  IT  a|  — a};  taking 
C = a:  we  have  C1  — {a  < k \ \a\  = a),  C = {a  < k \ coa  — a],  and  so 
forth.  However,  Hausdorff  was  to  write  in  his  classic  text  [14: 131]  that  if  weakly 
inaccessible  cardinals  did  exist,  “the  least  among  them  has  such  an  exorbitant 
magnitude  [exorbitanten  Grosse]  that  it  will  hardly  ever  come  into  consideration 
for  the  usual  [iiblich]  purposes  of  set  theory”.  It  is  now  well-known  that  the 
existence  of  weakly  inaccessible  cardinals  cannot  be  established  in  ZFC  (since 
Lk  f=  ZFC  for  such  a:  - see  §3). 

Inspired  by  Hausdorff  s work  Mahlo  [11, 12, 13]  ventured  much  further  and 
investigated  hierarchies  of  regular  cardinals  formulated  in  terms  of  higher  fixed 
point  phenomena.  Given  its  early  appearance  this  work  is  remarkable  for  its 
boldness  and  sophistication.  (Mahlo  was  a student  of  Felix  Bernstein  at  Halle 
and  attended  seminars  of  Cantor  there,  completing  his  dissertation  (on  geometry) 
in  1908.  He  was  first  to  publish  [13a]  a construction  under  CH  of  what  is  now 
known  as  a Luzin  set,  an  uncountable  set  of  reals  that  has  countable  intersection 
with  every  meager  set.  Gottwald-Kreiser  [84]  discusses  Mahlo’s  life  and  work.) 
Recasting  Mahlo’s  na  numbers, 

at  is  0 -weakly  inaccessible  iff  k is  regular  ; 

a:  is  (a  + \)-weaklv  inaccessible  iff  a:  is  a regular  limit  of  a-weakly 

inaccessible  cardinals ; and 
k is  S-weakly  inaccessible  iff  k is  a-weakly  inaccessible 

for  every  a < S 

for  limit  ordinals  8 > 0.  This  hierarchy  can  be  extended  through  diagonalization, 
considering  next  the  regular  limits  of  those  | that  are  f -weakly  inaccessible.  By 
a simple  induction  argument,  if  k is  ft -weakly  inaccessible  and  a < f,  then  k is 
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a -weakly  inaccessible.  With  Reg  the  class  of  regular  cardinals  and  A the  operation 
defined  on  X c On  by 

A(X)  = [a  e X | |XHa|  = a]  , 

the  a-weakly  inaccessible  cardinals  are  just  the  members  /l"(Reg),  where  the 
superscript  indicates  the  number  of  iterative  applications  of  A with  intersections 
taken  at  limit  stages.  The  hierarchy  of  a-weakly  inaccessible  cardinals  can  thus  be 
seen  as  the  consequence  of  imposing  regularity  on  the  process  of  taking  cardinal 
limits. 

The  A operation  leads  to  larger  and  larger  cardinals,  but  typically,  a new 
principle  is  needed  to  achieve  a qualitative  transcendence.  Mahlo  was  able  to 
formulate  such  a principle,  using  for  the  first  time  the  concept  of  a stationary  set. 
(The  term  itself  comes  from  Bloch  [53].)  His  po  numbers  are  now  known  through 
the  following  definition  for  a:  > a>: 

k is  weakly  Mahlo  iff  {p  < k \ p is  regular}  is  stationary  in  k . 

Such  a k is  regular  (if  there  were  an  unbounded  X c k such  that  | Aj  < k,  then 
the  limit  points  of  X — \ X\  other  than  k would  form  a closed  unbounded  subset 
not  containing  any  regular  cardinals),  and  so  weakly  Mahlo  cardinals  are  weakly 
inaccessible.  The  following  illustrates  what  led  Mahlo  to  these  cardinals: 

1.1  Proposition  (Mahlo  [11]).  If  k is  weakly  Mahlo,  then  k is  k -weakly  inacces- 
sible. 

Proof.  Setting  R — {p  < k \ p is  regular],  define  sets  Ca  closed  unbounded  in  k 
for  a < k by  recursion  as  follows:  Set  Co  — k.  Given  Ca,  let  Ca+ 1 consist  of  the 
limit  points  of  Ca  fl  R other  than  k.  As  k is  weakly  Mahlo,  Ca  fl  R is  stationary, 
and  so  Ca+i  is  closed  unbounded.  Finally,  for  limit  8 > 0 set  Q = n«<«  Ca- 
li is  readily  seen  by  induction  that  for  a < k,  Ca  fl  R consists  of  the  a-weakly 
inaccessible  cardinals  below  k,  and  hence  the  proof  is  complete.  H 

This  result  is  not  optimal  since  we  could  take  the  diagonal  intersection  C — 
Aa<  KCa,  infer  that  CflR  = {f</r|^is  ^-weakly  inaccessible]  is  stationary 
in  k,  and  continue.  Interestingly  enough,  Gaiffnan  [67]  showed  that  in  a concrete 
sense  a weakly  Mahlo  cardinal  is  the  least  upper  bound  of  diagonalizing  limit 
processes  from  below.  Such  an  upward  approach  is  not  possible  for  the  larger 
cardinals  that  will  be  encountered. 

Mahlo  formulated  his  pa  numbers  by  iterating  the  new  process: 

k is  0 -weakly  Mahlo  iff  k is  regular  ; 
k is  (a  + \)-weakly  Mahlo  iff  {%<  k | £ is  a-weakly  Mahlo] 

is  stationary  in  k ; and 

k is  8-weakly  Mahlo  iff  k is  a-weakly  Mahlo  for  every  a < 8 

for  limit  ordinals  8 > 0.  Again,  the  hierarchy  can  be  extended  through  diago- 
nalization,  considering  next  those  k such  that  [f  < k | £ is  ((-weakly  Mahlo] 
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is  stationary  in  at.  By  a simple  induction  argument,  if  k is  f -weakly  Mahlo  and 
a < f,  then  k is  a-weakly  Mahlo.  The  following  operation  defined  for  X c On 
is  now  known  as  Mahlo ’s  operation: 

M(X)  — [a  e X | X fl  a is  stationary  in  a]  . 

As  for  A and  the  a-weakly  inaccessible  cardinals,  the  a-weakly  Mahlo  cardinals 
are  just  the  members  of  Ma( Reg);  1.1  illustrates  the  transcendence  of  M over  A. 
Despite  its  modest  debut  Mahlo’s  operation  is  now  a standard  part  of  the  modem 
theory. 

Almost  two  decades  were  to  pass  before  the  initial  preoccupation  with  the 
extent  of  the  ordinals  was  enhanced  by  considerations  involving  the  width  of  the 
set-theoretic  universe  as  advanced  by  the  power  set  operation.  Sierpinski-Tarski 
[30]  and  Zermelo  [30]  formulated  the  following  concept  for  k > a>: 

k is  (strongly)  inaccessible  iff  k is  regular  and  a strong  limit: 

if  X < k,  then  2 x < k . 

The  adverb  “strongly”  is  suppressed  for  this  preferred  notion.  Assuming  GCH,  k 
is  inaccessible  iff  k is  weakly  inaccessible. 

The  cumulative  hierarchy  view  V — Va  of  the  set-theoretic  universe  was 
emerging  at  this  time.  In  its  terms,  the  following  proposition  implies  that  the 
existence  of  inaccessible  cardinals  cannot  be  established  in  set  theory: 

1.2  Proposition.  Suppose  that  k is  inaccessible.  Then: 

(a)  If  x c VK,  then  x e VK  iff  \x\  < k. 

(b) (VK,e)\=  ZFC. 

Proof,  (a)  In  the  forward  direction,  it  suffices  to  show  that  \Va\  < k for  a < k. 
But  this  follows  readily  by  induction  on  a.  Conversely,  suppose  that  x c VK  with 
|x|  < k.  By  the  regularity  of  k,  [rank(y)  | y e x]  C a for  some  a < k,  and  so 
V €=  1^*+ 1 ^ Ijc . 

(b)  All  the  axioms  of  ZFC  except  Replacement  are  readily  seen  to  hold  in  Va 
for  any  limit  ordinal  a > o>.  To  verify  Replacement  for  VK,  suppose  that  x e VK 
and  F is  any  function:  x — > VK.  Then  \F“x\  < |x|  < k and  so  by  (a)  F“x  e VK. 

H 


Temporarily  let  IC  be  the  hypothesis:  There  is  an  inaccessible  cardinal.  If  IC 
and  k is  the  least  inaccessible  cardinal,  then  it  is  readily  seen  that  VK  |=  ZFC  + 
->IC.  Hence,  IC  is  not  provable  in  ZFC.  This  first  independence  result  over  ZFC 
was  essentially  observed  in  Zermelo  [30]  and  was  also  asserted  in  Sierpinski- 
Tarski  [30],  (Kuratowski  [24]  reported  that  he  had  considered  a theory  of  sets 
- essentially  ZFC  sans  Foundation  - and  remarked  that  even  in  the  presence  of 
Replacement  “one  cannot  establish  [on  ne  saurait  etablir]”  the  existence  of  weakly 
inaccessible  cardinals.  However,  despite  the  reading  of  Mostowski  [49:  162]  this 
could  not  have  been  shown  at  the  time  without  assuming  some  form  of  GCH.) 
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That  the  existential  postulation  IC  immediately  leads  to  its  own  independence  is 
analogous  to  the  situation  with  the  Axiom  of  Infinity:  Va)  \=  r Every  set  is  finite1 . 
Similarly,  if  k is  the  least  Mahlo,  then  VK  |=  r There  are  no  Mahlo  cardinals1 . This 
is  a typical  feature  of  large  cardinals. 

The  independence  of  IC  can  also  be  established  by  appealing  to  the  Second 
Incompleteness  Theorem  of  Godel  [31]  since  ZFC  + IC  b Con(ZFC)  by  1.2(b). 
This  further  implies  that  Con(ZFC  + 1C)  is  not  provable  from  Con(ZFC),  again 
by  Godel’s  theorem.  This  finer  result  points  out  a qualitative  difference:  the 
very  possibility  of  discussing  formal  consistency  in  Godel’s  careful  analysis  far 
transcends  the  simple  proof  of  1.2(b).  Tarski  soon  became  aware  of  this  distinction 
([38:87]). 

With  ZFC  having  been  cast  as  a first-order  theory  1.2(b)  does  not  characterize 
inaccessibility,  by  the  Lowenheim-Skolem  Theorem  (0.5).  Significantly,  this  fact 
was  observed  only  latterly  (Mostowski  [49],  Montague- Vaught  [59])  in  the  wake 
of  new  model-theoretic  initiatives.  On  the  other  hand,  a characterization  can  be 
achieved  by  venturing  into  higher-order  logic.  Let  ZFC2  denote  the  second-order 
version  of  ZFC  where  the  schema  of  Replacement  is  replaced  by  a single  axiom 
with  a second-order  universal  quantifier.  The  following  is  established  with  the 
intended  interpretation  of  second-order  variables  as  ranging  over  arbitrary  subsets 
of  the  domain. 

1.3  Theorem  (Zermelo  [30],  Shepherdson  [52]).  k is  inaccessible  iff  VK  (=  ZFC2. 

Proof.  The  forward  direction  is  as  for  1.2(b). 

For  the  converse,  first  note  that  k is  regular:  If  not,  there  would  be  an  a < k 
and  a function  G:  a — »■  k with  a range  unbounded  in  k . Then  G c VK,  and 
so  by  second-order  Replacement  G“a  e VK.  Hence,  sup(G“a)  = k e VK,  a 
contradiction. 

It  remains  to  establish  that  at  is  a strong  limit  cardinal:  If  not,  there  would 
be  a A.  < k such  that  2X  > k.  Then  V(X)  e VK  by  the  Power  Set  Axiom,  yet  there 
would  be  a surjection  H : V (A.)  ->  k . So  again  by  second-order  Replacement, 
H“V(X)  = k e VK,  a contradiction.  H 

Strictly  speaking,  the  intrusion  of  the  second-order  satisfaction  relation  is 
not  essential  here  since  ZFC2  can  be  cast  as  a single  axiom:  VK  \=  ZFC2  is 
equivalent  to  the  relativization  (pv‘+1  for  some  first-order  formula  (p.  John  Shep- 
herdson [52]  was  first  to  give  a formal  proof  of  1.3.  However,  it  is  a remarkable 
historical  happenstance  that  Zermelo  [30]  had  already  established  an  informal  ver- 
sion: 

In  [30]  Zermelo  presented  his  final  axiomatization  of  set  theory,  incorpo- 
rating Replacement  and  Foundation,  and  moreover  offered  a striking,  synthetic 
vision  of  sets  as  partaking  in  a succession  of  natural  models.  From  his  [29]  clar- 
ification of  the  crucial  notion  of  definit  property  used  in  his  Separation  Axiom 
it  is  evident  that  Zermelo  couched  his  approach  in  higher-order  logic.  Strictly 
speaking,  this  would  be  the  formal  interpretation,  and  Shepherdson’ s criticism 
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([52: 227])  of  Zermelo’s  proof  of  1.3  as  “insufficiently  rigorous”  is  justified  from 
this  point  of  view.  But  Zermelo  had  adopted  a definite  anti-formalist  viewpoint 
by  then,  and  downgraded  the  significance  of  Skolem’s  remarks  [23]  on  the  rel- 
ativism of  first-order  formalizations.  For  Zermelo  concepts  like  power  set  and 
cardinal  number  simply  had  definite  extensions  and  assumed  an  absolute  signifi- 
cance. 

It  was  in  Zermelo  [30]  that  initial  segments  of  the  cumulative  hierarchy 
Lb  V*  were  first  proposed  as  models  for  the  axioms  of  set  theory.  Although  the 
hierarchy  was  adumbrated  by  Mirimanov  [17]  and  by  von  Neumann  [25],  Zermelo 
was  the  first  to  actually  adopt  the  Foundation  Axiom.  Moreover,  he  started  with 
collections  of  urelements  (his  term)  as  bases  for  his  hierarchical  models.  Although 
this  device  has  turned  out  to  be  unnecessary  for  the  enrichment  of  set  theory  itself, 
it  is  still  important  in  restricted  contexts  (like  admissible  sets  - see  Barwise  [75]). 

Zermelo’s  main  achievement  was  to  establish  a second-order  categoricity  of 
sorts  for  his  axioms:  he  showed  that  his  models  are  characterized  up  to  isomor- 
phism by  two  cardinals,  the  number  of  its  urelements  and  the  height  of  its  ordinals. 
Moreover,  there  is  a unique  end-extension  relation  between  any  two  models,  based 
on  these  two  invariants,  so  that  one  model  is  just  a set  in  a higher  domain.  In 
order  to  establish  these  results  Zermelo  used  the  cumulative  hierarchy  analysis 
to  correlate  models  rank  by  rank.  Grappling  with  Replacement  he  characterized 
these  ordinal  heights  of  models  (“Grenzzahlen”)  as  regular  fixed  points  of  the  Beth 
function,  and  hence  arrived  at  inaccessible  cardinals  essentially  through  1.3. 

Zermelo  went  on  to  propound  a dynamic  view  of  sets  that  posits  an  endless 
succession  of  models  of  set  theory  ([30:47]): 

The  ‘ultrafinite  antinomies  of  set  theory’  that  scientific  reactionaries  and  anti- 
mathematicians refer  to  so  assiduously  and  lovingly  in  their  campaign  against  set  theory, 
these  seeming  ‘contradictions’,  are  only  due  to  a confusion  of  set  theory  itself,  which  is 
non-categorically  determined  by  its  axioms,  with  particular  representing  models:  What 
appears  in  one  model  as  an  ‘ultrafinite  non-or  metaset’  is  in  the  next  higher  one  already  a 
fully  valid  ‘set’  with  cardinal  number  and  ordinal  type,  and  is  itself  the  foundation  stone 
for  the  construction  of  the  new  domain.  The  unlimited  series  of  Cantor’s  ordinal  numbers 
is  matched  by  just  as  infinite  a double  series  of  essentially  different  set-theoretic  models, 
the  whole  classical  theory  being  manifested  in  each  of  them.  The  two  diametrically  oppo- 
site tendencies  of  the  thinking  spirit,  the  idea  of  creative  progress  and  of  comprehensive 
completion,  which  also  lie  at  the  root  of  the  Kantian  ‘antinomies’,  find  their  symbolic 
representation  and  symbolic  reconciliation  in  the  transfinite  series  of  numbers  based  on 
the  concept  of  well-ordering.  This  series  in  its  boundless  progression  does  not  have  a true 
conclusion,  only  relative  stopping  points,  namely  those  ‘limit  numbers’  [i.e.  inaccessible 
cardinals]  which  separate  the  higher  from  the  lower  model  types.  And  thus  also,  the  set- 
theoretic  ‘antinomies’  lead,  if  properly  understood,  not  to  a restriction  or  mutilation  but 
rather  to  a presently  unsurveyable  unfolding  and  enrichment,  of  mathematical  science. 

These  words  run  counter  to  Cantor’s  (and  later,  Godel’s)  view  of  a fixed  set- 
theoretic  universe,  yet  conform  in  a way  to  his  realist  conception  of  mathematics 
as  consisting  of  all  consistently  conceivable  objects.  In  any  event,  it  is  notewor- 
thy that  inaccessible  cardinals  played  such  a prominent  role  in  an  early  structural 
analysis  of  the  universe  of  sets.  Tarski  [38,39a]  later  formalized  the  existence 
of  arbitrarily  many  inaccessible  cardinals  as  an  axiom,  phrased  in  such  a way 
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that  most  of  the  axioms  of  set  theory  including  the  Axiom  of  Choice  are  deriv- 
able. Nevertheless,  Zermelo’s  infinitistic  conception  was  soon  to  be  overtaken 
by  a growing  preoccupation  with  the  formalism  of  first-order  logic,  following  the 
incisive  analyses  of  Skolem  and  Godel. 

In  analogy  with  the  weakly  Mahlo  cardinals, 

k is  (strongly)  Mahlo  iff  {a  < k | or  is  inaccessible}  is  stationary  in  k . 

As  with  inaccessibility  the  adverb  “strongly”  is  suppressed  for  this  preferred  no- 
tion. The  hierarchies  of  the 

a-inaccessible  and  a-Mahlo  cardinals 

are  analogously  defined,  and  the  analogue  of  1.1  obtains. 

Almost  half  a century  after  their  introduction  Levy  revitalized  the  study  of 
Mahlo  cardinals  with  his  investigation  of  reflection  phenomena,  shifting  the  fo- 
cus from  the  “weak”  to  the  “strong”  versions  because  of  the  interplay  with  the 
cumulative  hierarchy  (§6). 
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2.  Measurability 

This  section  picks  up  another  thread,  one  that  led  to  measurability,  the  most 
prominent  of  all  large  cardinal  hypotheses.  This  development  occurred  in  Poland, 
which  featured  a school  of  mathematics  crucial  to  the  foundations  of  mathemat- 
ical logic,  topology,  and  analysis.  After  the  reunification  of  the  country  in  1918 
Zygmunt  Janiszewski  at  the  newly  reopened  University  of  Warsaw  encouraged 
the  focusing  of  Polish  mathematics  on  set  theory  and  related  areas  to  establish  a 
national  tradition,  and  the  publication  of  a new  journal  to  promote  international 
research  in  these  directions  (see  Kuratowski  [80]  and  Kuzawa  [68]).  This  was 
the  origin  of  Fundamenta  Mathematicae  (genitive  singular),  the  first  specialized 
journal  devoted  to  foundational  issues  and  the  main  conduit  of  scholarship  in  this 
general  area  during  the  1920’s  and  1930’s  (see  Kuzawa  [70]).  The  first  volume, 
incidentally,  contained  the  well-known  problem  of  Suslin  [20]  that  led  to  the  in- 
vestigation of  Suslin  trees  in  modem  set  theory.  At  Warsaw  Sierpinski  together 
with  Kuratowski  and  Tarski  were  soon  making  fundamental  contributions  to  set 
theory  and  the  understanding  of  its  role  in  mathematics.  For  present  concerns 
work  of  Stefan  Banach  and  the  young  Stanislaw  Ulam  at  Lwow  (now  Lviv)  on 
an  abstract  measure  problem  turned  out  to  be  fundamental. 

Modem  measure  theory  dates  back  to  Lebesgue’s  thesis  [02],  where  he  posed 
the  Measure  Problem.  For  the  real  line  this  asks:  Is  there  a function  m that 
associates  to  every  bounded  set  of  reals  X a non-negative  real  number  m (X)  such 
that: 

(a)  m is  not  identically  zero  . 

(b)  m is  translation-invariant,  i.e.  m(X)  = m(Y)  whenever 
there  is  a real  r such  that  Y = [x  + r x e X]  , and 

(c)  m is  countably  additive,  i.e.  if  {Xn  \ n e co]  is  a pairwise  disjoint 
collection  whose  union  is  a bounded  set  of  reals,  then 

m(U„  Xn)  = J2„  ™(Xn)  ■ 

Lebesgue  developed  his  measure  towards  a solution  to  this  problem,  and  of  course, 
it  is  now  integral  to  mathematical  analysis.  (For  the  historical  development  of 
Lebesgue  measure  and  integration,  see  Flawkins  [75].)  Part  of  the  problem  was 
to  decide  what  passed  for  a set  of  reals.  Giuseppe  Vitali  [05]  constructed  a 
non-Lebesgue  measurable  set  of  reals  from  a well-ordering  of  the  reals,  thereby 
showing  that  the  Measure  Problem  has  no  solution  under  the  Axiom  of  Choice. 
This  was  the  first  explicit  use  of  AC  to  construct  a specific  set  of  reals  after 
Zermelo’s  formulation  of  the  axiom,  a manipulative  use  of  a well-ordering  very 
different  from  Cantor’s  associations  of  sets  with  well-orderings.  For  Lebesgue 
[07],  Vitali’s  construction  raised  doubts  not  so  much  about  the  possibilities  for  a 
measure  but  about  AC. 

Vitali’s  counterexample  used  all  the  conditions  (a)-(c).  Banach  proposed 
a generalization  of  the  Measure  Problem  where  (b)  is  replaced  by  a minimal 
necessary  condition  to  avoid  trivial  solutions:  m({x})  = 0 for  every  x.  With 


2.  Measurability 


23 


a proof  that  has  a contemporary  relevance  Banach-Kuratowski  [29]  established 
that  under  CH  this  version  of  the  problem  also  has  no  solution.  Note  that  by 
(b)  and  (c)  a solution  m to  the  Measure  Problem  is  determined  by  its  values  on 
"P([0,  1]),  the  subsets  of  the  unit  interval  [0,  1],  Banach  realized  that  his  condition 
removed  geometric  considerations  from  the  problem,  so  that  [0,  1]  can  be  replaced 
by  an  arbitrary  set  S.  In  this  case,  if  m is  not  identically  0 on  V{S),  then  surely 
m(S ) > 0,  and  so  we  can  normalize  and  assume  that  m(S ) = 1.  Thus,  the  measure 
problem  of  Banach  [30]  might  as  well  be  posed  as  follows:  Is  there  a nonempty 
set  S and  a function  m:  V{S)  — > [0,  1]  such  that: 

(i)  m(S)  = 1 , 

(ii)  m({x\)  — 0 for  every  x e S , and 

(iii)  for  pairwise  disjoint  {X„  \ n e w}  c V(S), 

'”(U„  Xn)  = T,nm(Xn)  ■ 

Such  a function  will  simply  be  called  a measure  over  S (it  is  a measure  on  V(S) 
in  the  sense  of  §0).  The  following  exercise  points  to  a salient  feature  of  measures: 

2.1  Exercise.  IfT  C {X  C S \ m(X)  > 0}  is  uncountable,  then  there  are  distinct 
Y,  Z € T such  that  m(Y  fl  Z)  > 0. 

Hint.  Note  that  for  some  n e w,  {X  e T \ m(X)  > ^}  is  uncountable.  H 

For  a measure  m over  a set  S, 

m is  X-additive  iff  for  any  y < X and  pairwise  disjoint 

{Xa  | ol  < y}  C V(S),  m({JaXa)  = . 

(Flere,  a transfinite  sum  of  reals  is  the  supremum  of  the  sums  of  finite  subcollec- 
tions.) Banach  saw  that  only  the  cardinality  of  the  set  S matters  in  his  problem, 
and  that  in  lieu  of  property  (iii)  one  might  as  well  require  a stronger  property. 


2.2  Exercise.  Suppose  that  k is  the  least  cardinal  such  that  there  is  a measure  over 
k.  Then  every  measure  over  k is  K-additive. 

Hint.  If  not,  let  m be  a measure  over  k,  y < k,  and  {Xa  \ a < y}  C V(k) 
pairwise  disjoint  such  that  m( [J^  Xa)  f Then  y > w and  there  are 

only  countably  many  a’s  with  m(Xa ) > 0 by  2.1.  Removing  these  through  the 
additivity  property  (iii),  assume  without  loss  of  generality  that  each  m{Xa)  = 0, 
yet  /ndjff  xa)  — r > 0.  Now  check  that  m:  V(y)  — > [0,  1]  defined  by 


m(Y)  — 


^Uo'er^o') 

r 


is  a measure  over  y,  contradicting  the  minimality  of  k. 
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For  k > a>, 

k is  real-valued  measurable  iff  there  is  a k -additive  measure  over  k. 

If  m is  such  a measure,  then  clearly  m(X)  = 0 whenever  |X|  < k.  Hence,  it  is 
easy  to  see  that  a real-valued  measurable  cardinal  is  regular.  Banach  [30:  101] 
established  under  GCH  that  every  real-valued  measurable  cardinal  is  weakly  in- 
accessible. 

As  a student  at  Lwow  Ulam  [29]  had  already  provided,  in  measure-theoretic 
terms,  the  first  construction  of  an  ultrafilter  over  a>  using  a well-ordering  of  Vico). 
(Tarski  [29]  announced  the  general  result  that  any  filter  over  a set  can  be  extended 
to  an  ultrafilter  over  that  set.)  In  his  doctoral  dissertation  Ulam  then  established 
fundamental  results  concerning  Banach’s  measure  problem;  as  we  shall  see,  this 
work  involved  a direct  generalization  of  an  ultrafilter  over  &>.  Ulam  first  removed 
GCH  from  Banach’s  result  above;  for  this  purpose,  he  devised  a useful  combina- 
torial device  now  known  as  an  Ulam  matrix: 

2.3  Proposition  (Ulam  [30]).  For  any  X,  there  is  a collection  of  sets 
{A|  | a < X+  A | < k]  C V(X+)  satisfying 

(a)  A|  fl  A|  = 0 whenever  a < f < X+  and  f < X;  and 

(b)  |k+  — |J|<X  A||  < X for  each  a < X+ . 

Proof  For  each  77  < X+  let  fn:  X —>  ;/  + 1 be  surjective,  and  for  a < X+  and 
§ < X set  A|  = { 77  < X+  | f^(f)  — a}.  Then  (a)  is  immediate  and  for  (b)  note 
that  (A  + - |Jf<A.  A|)  c a.  H 

2.4  Corollary.  If  k is  real-valued  measurable,  then  k is  weakly  inaccessible. 

Proof.  Since  k must  be  regular,  it  remains  to  establish  that  k is  a limit  cardinal. 
Assume  to  the  contrary  that  k — k+ , and  consider  an  Ulam  matrix  as  in  2.3.  Let 
m be  a /c-additive  measure  over  k.  Then  for  each  a < X+,  there  is  a < X such 
that  m(A|“)  > 0 by  2.3(b).  But  then,  there  must  be  a fixed  f < X such  that  = £ 
for  k+  many  a’s.  By  2.3(a),  this  contradicts  2.1.  H 

Ulam  then  pointed  out  a major  dichotomy.  For  a measure  m over  k, 

A C k is  an  atom  for  m iff  m(A ) > 0 yet  for  any  B C A, 

m(B ) = m(A)  or  /77(B)  = 0 , and 
m is  atomless  iff  there  are  no  atoms  for  m . 

Ulam  drew  important  conclusions  both  from  the  existence  of  an  atomless  measure 
and  from  the  existence  of  a measure  with  an  atom,  (b)  of  the  following  does  not 
need  much  beyond  Ulam’s  proof  of  (a). 

2.5  Theorem  (Ulam  [30]).  Suppose  that  there  is  an  atomless  K-additive  measure 
///  over  k.  Then: 

(a)  k < 2K°. 

(b)  There  is  a measure  over  the  reals  extending  Lebesgue  measure. 
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2.6  Lemma. 

(i)  For  any  f el  with  e > 0 and  X C.  k with  m(X ) > 0,  there  is  a Y C X 
satisfying  0 < m(Y)  < e. 

(ii)  For  any  X C k there  is  a Y C X satisfying  m{Y)  — \ ■ m(X). 

Proof,  (i)  It  suffices  to  recursively  define  a c-descending  sequence  of  sets  Xt 
satisfying  0 < m(Xi+\ ) < j ■ m(Xj ) for  each  i e co.  But  given  Xt,  since  m 
is  atomless,  there  is  a partition  A U B — X,  such  that  0 < m(A)  < m(B);  set 
Xi+i  = A. 

(ii)  Recursively  define  a c-descending  sequence  of  sets  Xa  such  that  m(Xa ) > 
\ ■ m(X)  for  as  long  as  possible,  as  follows:  Set  Xo  = X.  If  Xa  has  been  de- 
fined, define  Xa+\  exactly  when  m(Xa)  > \ ■ m(X),  in  which  case  it  is  to  satisfy 
Xa+i  ^ Xa  and  m(Xa)  > m(Xa+\)  > \ ■ m(X).  This  is  possible  by  (i).  Finally, 
for  S a limit  ordinal  define  Xs  = Ho!<6  Xa  exactly  when  Xa  has  been  defined  for 
each  a < 8.  There  must  now  be  an  a < a>\  such  that  m(Xa ) = ^ • rn(X),  else  the 
collection  {Xa  — Xa+i  | a < an}  would  contradict  2.1.  H 

Proof  of  2.5.  For  each  s e <wco  define  sets  Xs  C k as  follows:  Set  Xtg  = k.  Given 
Xs,  apply  2.6(ii)  recursively  to  get  sets  Xs^^  for  i e co  such  that 

xs  = Uieo,  xs'  (i)  is  a disjoint  union  , 


and 

m{Xm)  = 2-(i+1)  • m(Xs)  . 

For  each  / e “’co  set  Yf  = fjn  Xf\n,  so  that  m(Yf ) = 0.  k = (J {Yf  \ f e wco}, 
and  so  m cannot  be  (2K°)+ -additive,  and  hence  (a)  follows. 

For  (b)  define  /x:  V(wco)  ->  [0,  1]  by 

M(A)  = m(U{f>  | fe  A}). 

It  is  simple  to  check  that  /x  is  a (in  fact  /r-additive)  measure  over  coa>.  It  remains 
to  use  properties  of  the  Lebesgue  measure  mL  as  formulated  in  §0  to  show  that  \i 
extends  mi.  By  the  construction  of  Xs  for  .v  e ''"’a), 

HiO(s))  = m(Xs)  = n,<|S|2-(i(')+1)  = mL(0(s))  . 

Thus,  by  definition  of  mL,  yu  and  mL  agree  on  the  Borel  sets.  But  then,  /i ( N ) = 0 
for  any  null  set  N,  and  consequently  //  and  mL  agree  on  the  Lebesgue  measurable 
sets.  H 


This  result  provided  for  the  first  time  a well-motivated  example  of  a weakly 
inaccessible  cardinal  < 2^°,  and  hence  a plausible  hypothesis  implying  a drastic 
failure  of  Cantor’s  Continuum  Hypothesis.  Much  stronger  results  are  now  known 
along  these  lines,  but  almost  forty  years  were  to  pass  before  the  primary  results 
on  real-valued  measurability  < 2K°  were  established  by  Solovay  (§§16, 17).  For 


26 


Chapter  1 . Beginnings 


a portmanteau  compendium  of  recent  results  on  real-valued  measurability,  see 
Fremlin  [93]. 

The  other  road  taken  by  Ulam  was  to  be  even  more  consequential.  Suppose 
that  a k -additive  measure  m over  k does  have  an  atom  A c k . If  p is  defined  on 
VO <)  by 


lx(X)  = 


m(X  (T  A ) 
m (A) 


then  /x  is  a k -additive  measure  over  k with  range  [0,  1}.  This  property  can  be 
conveniently  expressed  in  terms  of  ultrafilters:  For  such  a two-valued  measure  /x, 
set: 

= {X  c K | n(X)  = 1}  . 


Then  Ufl  is  a (non-principal)  ultrafilter  over  k.  For  instance,  if  X.Y  e U^, 
then  Xnf  e Ufl : otherwise  X — Y e and  Y — X e Utl,  leading  to  the 
contradiction  fi(X  U Y)  > 2.  Furthermore,  the  /c -additivity  of  /x  translates  to  a 
strong  intersection  property:  Recall  that  a filter  F is  X-complete  iff  for  any  y < X 
and  {Xa  | a < y}  c F,  na<y  Xa  e F.  There  is  a simple  dual  characterization 
for  ultrafilters: 


2.7  Exercise.  An  ultrafilter  U is  X-complete  iff for  any  y < X and 

UfTa,  | a < y]  e U,  there  is  an  a < y such  that  Ya  e U . ~\ 

Flence,  our  f/M  is  /c-complete.  This  leads  to  the  most  important  concept  of 
all  large  cardinal  theory:  For  k > co, 

k is  measurable  iff  there  is  a k -complete  ultrafilter  over  k . 

Measurability  is  a direct  generalization  of  the  existence  of  ultrafilters  over  a>  (which 
of  course  are  &>-complete).  The  finite  intersection  property  is  automatically  pre- 
served when  taking  the  union  of  a chain  of  filters,  so  that  just  a maximal  principle 
is  needed  to  get  ultrafilters  over  co.  But  a: -completeness  for  a:  > co  is  not  similarly 
preserved,  so  that  the  existence  of  a /r-complete  ultrafilter  over  k must  be  explic- 
itly postulated.  Tarski  [39,45]  investigated  the  concept  of  measurability  in  the 
context  of  Boolean  algebras  and  a: -complete  prime  ideals  - just  the  dual  notion. 
The  following  result  stands  in  contrast  to  2.5(a): 

2.8  Theorem  (Ulam;  Tarski  - Ulam  [30:  146]).  If  k is  measurable,  then  k is 
inaccessible. 

Proof.  What  remains  beyond  2.4  is  to  establish  that  a:  is  a strong  limit.  Suppose 
that  U is  a a: -complete  ultrafilter  over  k,  and  assume  to  the  contrary  that  X < k, 

yet  there  is  an  injective  function  f\ic—>  x2.  For  each  a < X,  there  is  an  ia  < 2 

such  that  Xa  — {§  < k \ /(^)(a)  = ia}  e U.  Flence,  X — ["]«<;,  ^<*  e U,  and 

for  § e X,  f(f){a)  = ia  for  every  a < X.  But  then,  X can  have  at  most  one 

member,  which  is  a contradiction.  H 
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Whether  the  least  measurable  cardinal  is  strictly  larger  than  the  least  inac- 
cessible cardinal  became  a focal  question,  and  was  settled  only  thirty  years  later. 
For  a result  that  by  present-day  standards  is  rather  straightforward  this  may  seem 
like  a remarkably  long  time  - even  taking  into  account  the  convulsive  events  that 
took  place  in  Europe.  However,  this  is  a typical  case  of  a long-standing  math- 
ematical problem  suddenly  solved  in  the  wake  of  new  techniques  - and  perhaps 
surprisingly,  sufficiently  strong  methods  were  first  to  emerge  in  the  semantics  of 
infinitary  languages  (§4). 

Meanwhile,  the  center  stage  in  set  theory  was  taken  by  Godel’s  formula- 
tion of  the  constructible  hierarchy  and  the  consistency  of  the  Axiom  of  Choice 
and  the  Generalized  Continuum  Hypothesis  (§3).  Nonetheless,  in  the  growing 
abstraction  of  modern  mathematics  measurability  occurred  in  various  fields  as  a 
limitative  concept:  Mackey  [44],  vector  lattices;  Hewitt  [48],  rings  of  continuous 
functions;  Ehrenfeucht-Los  [54],  infinite  cyclic  groups;  and  further  references  in 
Keisler-Tarski  [64: 272],  It  remained  for  Scott  to  establish  the  modem  and  central 
relevance  of  measurable  cardinals  in  connection  with  the  ultrapower  construc- 
tion (§5). 
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3.  Constructibility 

This  section  does  not  discuss  a new  large  cardinal  concept,  but  rather  the  major 
contribution  to  set  theory  of  the  foremost  mathematical  logician  of  our  time  and  his 
speculations  about  large  cardinals.  That  contribution  not  only  launched  axiomatic 
set  theory  as  a distinctive  field  of  mathematics,  but  stimulated  subsequent  work  in 
large  cardinals  first  in  complementary  reaction  and  then  in  structural  generalization. 

In  October  of  1935  Godel  informed  von  Neumann  at  the  Institute  for  Ad- 
vanced Study  in  Princeton  that  he  had  established  the  relative  consistency  of  the 
Axiom  of  Choice.  This  he  did,  of  course,  by  devising  his  constructible  hierarchy  L 
(for  “law”)  and  verifying  the  axiom  there.  Godel  conjectured  that  the  Continuum 
Hypothesis  would  also  hold  in  L,  but  he  soon  fell  ill  and  only  gave  a proof  of  this 
and  GCH  two  years  later  (the  crucial  idea  apparently  came  to  him  during  the  night 
of  June  14-15,  1937  - Godel  [86:40]).  In  addition  to  these  fundamental  results 
he  observed  that  in  L there  are  delimitative  counterexamples  for  descriptive  set 
theory  (§§12,13). 

Godel’s  article  [38]  in  the  Proceedings  of  the  National  Academy  of  Sciences 
U.S.A.  was  the  first  announcement  of  these  results,  and  the  succeeding  [39]  pro- 
vided more  details  in  the  context  of  ZF.  To  review,  a set  y is  definable  over 
a structure  M iff  there  is  a first-order  formula  (p( vq,  . . . , vn)  in  the  language 
of  M and  parameters  a\,...,an  in  the  domain  of  M such  that:  z e y iff 
M |=  (p[z,  a i, . . . , a,,].  For  any  set  x, 

def(x)  = {y  c x \ y is  definable  over  (x,  e) } . 

This  is  a set,  being  itself  a subset  of  V(x)  definable  via  the  formalized  satisfaction 
relation  for  (x,  e).  Now  define  by  transfinite  recursion: 


L0  = 0 ; La+ 1 = def(L„)  ; Ls  = [J a<sLa  for  limit  S > 0 ; 


and 

L — • 

Thus,  L is  a (definable)  class,  the  class  of  constructible  sets,  and  the  assertion 
V — L,  i.e.  Vx(x  e L),  is  the  Axiom  of  Constructibility . It  is  already  apparent  why 
the  Axiom  of  Choice  would  hold  in  L : By  transfinite  recursion  within  L one  can 
well-order  L level-by-level,  well  ordering  La+ \ — La  according  to  definitions  and 
the  previous  well-ordering  of  the  parameters  from  La.  The  verification  of  GCH 
was  less  obvious,  evidently  inspired  by  the  analysis  of  Skolem  [23],  the  source 
of  the  Lowenheim-Skolem  Theorem.  Devlin  [84],  Kunen  [80],  Moschovakis  [80] 
and  Drake  [74]  provide  expositions  of  Godel’s  results. 

Godel  believed  that  careful  epistemological  analysis  can  lead  to  significant 
mathematical  advances,  and  his  remarkable  successes  amply  bear  out  this  con- 
tention. His  results  with  L actually  represent  a steady  intellectual  development 
from  his  celebrated  Incompleteness  Theorem  [31]  which  was  to  extend  later  to 
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speculations  on  large  cardinals.  Already  in  footnote  48a  to  [31],  made  much  of 
by  Kreisel  [80],  Godel  wrote: 


. . . the  true  reason  for  the  incompleteness  inherent  in  all  formal  systems  of  mathematics 
is  that  the  formation  of  ever  higher  types  can  be  continued  into  the  transfinite  . . . while  in 
any  formal  system  at  most  denumerably  many  of  them  are  available.  For  it  can  be  shown 
that  the  undecidable  propositions  constructed  here  become  decidable  whenever  appropriate 
higher  types  are  added  (for  example,  the  type  co  to  the  system  P [Peano  Arithmetic]).  An 
analogous  situation  prevails  for  the  axiom  system  of  set  theory. 


Godel’s  first  description  of  L in  [38]  shows  how  he  had  built  on  this  insight: 


This  model,  roughly  speaking,  consists  of  all  “mathematically  constructible”  sets,  where 
the  term  “constructible”  is  to  be  understood  in  the  semiintuitionistic  sense  which  excludes 
impredicative  procedures.  This  means  “constructible”  sets  are  defined  to  be  those  sets 
which  can  be  obtained  by  Russell’s  ramified  hierarchy  of  types,  if  extended  to  include  the 
transfinite  orders.  The  extension  to  transfinite  orders  has  the  consequence  that  the  model 
satisfies  the  impredicative  axioms  of  set  theory,  because  an  axiom  of  reducibility  can  be 
proved  for  sufficiently  high  orders. 


Godel  thus  regarded  his  hierarchy  as  a transfinite  extension  of  Russell’s,  which 
can  be  construed  as  a formal  theory  of  L0J+m.  Although  this  is  a hierarchy  of  pred- 
icative definitions  (i.e.  quantifying  over  domains  of  previously  formed  objects), 
Gddcl's  realization  was  that  extending  the  indexing  of  higher  types  through  all  the 
ordinals  would  lead  to  a completion  of  the  axioms  of  set  theory,  as  adumbrated 
in  that  footnote  to  [31].  Godel  stressed  the  historical  continuity:  the  allusion  to 
Russell’s  ill-fated  Axiom  of  Reducibility  is  a clear  reference  to  the  rectification:  if 
x e L\  and  X is  a cardinal  in  L,  then  for  any  y c x in  L there  is  a y < X such  that 
y e Ly.  Thus,  the  impredicative  power  set  operation  is  tamed  in  L,  leading  to  the 
consistency  of  GCH.  In  later  commentary  on  Russell’s  mathematical  logic  Godel 
[44: 147]  argued  that  with  his  “transfinite  theorem  of  reducibility”,  “all  impred- 
icativities  are  reduced  to  one  special  kind,  namely  the  existence  of  certain  large 
ordinal  numbers  (or  well-ordered  sets)  and  the  validity  of  recursive  reasoning  for 
them.” 

Interestingly  enough,  Gddel  viewed  L as  an  outright  construction  using  trans- 
finite reasoning  in  metamathematics.  See  Wang  [74:  8ff];  in  a letter  quoted  there 
Gddel  wrote  about  his  CH  result:  “. . . there  was  a special  obstacle  which  really 
made  it  practically  impossible  for  constructivists  to  discover  my  consistency  proof. 
It  is  the  fact  that  the  ramified  hierarchy,  which  had  been  invented  expressly  for 
constructive  purposes,  had  to  be  used  in  an  entirely  nonconstructive  way.”  Gddel 
was  aware  of  Zermelo’s  [30],  and  there  is  an  affinity  of  sorts  in  the  direct  use  of 
infinitary  methods  and  the  positing  of  successive  domains.  To  be  sure,  Zermelo 
did  not  formalize  his  logic,  while  Gddel  was  led  to  transfinite  types  by  his  inves- 
tigation of  formal  systems.  In  any  formalization  of  L the  extent  of  the  ordinals 
as  sustained  by  Replacement  has  to  be  accommodated,  and  significantly,  the  main 
statement  of  formal  consistency  about  ZF  in  Gddel  [39]  appealed  to  what  Zermelo 
had  called  “Grenzzahlen”:  If  k is  inaccessible,  then  LK  (=  ZFC  + GCH.  k need 
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only  be  weakly  inaccessible  here  because  of  the  proof  of  GCH  (Firestone-Rosser 
[49]). 

In  his  monograph  [40],  based  on  lectures  given  at  the  Institute  for  Advanced 
Study  during  the  winter  of  1938-39,  Gddel  gave  another  presentation  of  L.  This 
time  he  generated  L set  by  set  with  a transfinite  recursion  in  terms  of  eight 
elementary  set  generators,  a sort  of  Godel  numbering  into  the  transfinite.  These 
generators  were  based  on  a finite  second-order  axiomatization  of  the  Separation 
Schema  by  Paul  Bernays  [37],  which  in  turn  had  an  antecedent  in  a functional 
formulation  in  von  Neumann  [25].  Providing  a rigorous  formalization  of  his 
metamathematical  construction  Godel  now  emphasized  how  it  yields  an  “inner 
model”  and  finitary  relative  consistency  proofs.  In  particular,  an  external  appeal  to 
an  inaccessible  cardinal  was  no  longer  necessary.  Moreover,  the  new  presentation 
highlighted  the  stark  contrast  between  the  elementary  set  formation  processes  and 
the  extent  of  the  ordinals. 

Ironically,  the  development  of  set  theory  in  these  formative  years  may  have 
been  ill-served  by  [40].  Bearing  the  burden  of  authority  it  overshadowed  [39]  and 
obscured  its  model-theoretic  approach,  especially  in  the  Skolem  hull  argument 
for  the  consistency  of  GCH.  Only  decades  later  was  the  sort  of  fine  analysis  of 
[40]  to  become  an  appropriate  framework:  Godel  himself  considered  L only  as 
a contrivance  for  establishing  consistency  results,  but  in  the  mid- 1960’s  Ronald 
Jensen  [72]  developed  a “fine  structure”  theory  for  L of  intrinsic  interest,  and  the 
study  of  constructibility  and  its  generalizations  has  become  one  of  the  mainstreams 
of  modem  set  theory. 

Expanding  on  his  vision  of  completability  of  formal  systems  using  higher 
types  Godel  speculated  in  the  1940’s  about  the  possibility  of  deciding  proposi- 
tions with  large  cardinal  hypotheses,  particularly  with  respect  to  the  Continuum 
Problem.  In  a lecture  at  Princeton  University  in  December  of  1946,  Godel  re- 
marked [90: 151]): 


In  set  theory,  e.g.  the  successive  extensions  can  most  conveniently  be  represented  by 
stronger  and  stronger  axioms  of  infinity.  It  is  certainly  impossible  to  give  a combinatorial 
and  decidable  characterization  of  what  an  axiom  of  infinity  is  but  there  might  exist,  e.g.  a 
characterization  of  the  following  sort:  An  axiom  of  infinity  is  a proposition  which  has 
a certain  (decidable)  formal  structure  and  which  in  addition  is  true.  Such  a concept  of 
demonstrability  might  have  the  required  closure  property,  i.e.  the  following  could  be  true: 
Any  proof  for  a set-theoretic  theorem  in  the  next  higher  system  above  set  theory  ...  is 
replaceable  by  a proof  from  such  an  axiom  of  infinity.  It  is  not  impossible  that  for  such  a 
concept  of  demonstrability  some  completeness  theorem  would  hold  which  would  say  that 
every  proposition  expressible  in  set  theory  is  decidable  from  the  present  axioms  plus  some 
true  assertion  about  the  largeness  of  the  universe  of  all  sets. 


This  is  a remarkably  optimistic  statement  about  the  possibility  of  discovering 
new  “true”  axioms  that  will  decide  eveiy  set-theoretic  proposition.  Note  that  the 
following  reflection  argument  for  motivating  large  cardinals  is  almost  explicit:  (a) 
any  set-theoretic  proposition  can  be  established  in  the  “next  higher  system  above 
set  theory”,  say  if  the  satisfaction  relation  for  V were  available;  and  (b)  this  use  of 
the  relation  may  in  particular  cases  be  replaceable  by  a strong  axiom  of  infinity, 
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say  with  a corresponding  large  cardinal  playing  the  role  of  On.  Godel  went  on 
to  apply  his  reflection  argument  to  formulate  the  concept  of  ordinal  definability 
(see  Godel  [90: 146]).  The  Reflection  Principle  (see  after  6.2)  later  schematized  a 
basic  argument  that  can  be  carried  out  in  ZF. 

In  an  accentuated  form  reflection  came  to  be  the  main  heuristic  advanced  for 
motivating  various  large  cardinals:  Largeness  properties  ascribable  to  On  confront 
the  antithetical  contention  that  it  is  essentially  incomprehendable,  recalling  Can- 
tor’s Absolute.  This  inability  to  characterize  On  in  its  open-endedness  then  fosters 
the  synthetic  move  to  a large  cardinal  at  which  such  properties  obtain. 

The  Continuum  Problem  was  the  focus  of  an  expository  article  [47]  by  Godel 
that  included  rare  statements  concerning  his  realist  conception  of  mathematics.  He 
assumed  that  the  Continuum  Hypothesis  would  be  shown  independent  from  ZF, 
and  argued  moreover  that  it  must  be  false  according  to  certain  a priori  intuitions. 
Regarding  formal  axioms  that  might  resolve  such  questions  he  speculated  about 
large  cardinals  in  more  concrete  terms  than  in  his  1946  remarks.  Citing  Zermelo 
[30]  and  echoing  its  theme  Godel  argued  on  the  basis  of  the  cumulative  hierarchy 
for  new  axioms  “which  assert  the  existence  of  still  further  iterations  of  the  opera- 
tion ‘set  of  ”,  giving  inaccessible  and  Mahlo  cardinals  as  examples.  However,  he 
mentioned  that  cardinals  like  these  relativize  to  L (3.1)  and  hence  cannot  imply 
the  failure  of  CH. 

Gddel  went  on  to  speculate  ([47:  521]): 


. . . even  disregarding  the  intrinsic  necessity  of  some  new  axiom,  and  even  in  case  it  had 
no  intrinsic  necessity  at  all,  a decision  about  its  truth  is  possible  also  in  another  way, 
namely,  inductively  by  studying  its  “success”,  that  is,  its  fruitfulness  in  consequences  and 
in  particular  in  “verifiable”  consequences,  i.e.  consequences  demonstrable  without  the  new 
axiom,  whose  proofs  by  means  of  the  new  axiom,  however,  are  considerably  simpler  and 
easier  to  discover,  and  make  it  possible  to  condense  into  one  proof  many  different  proofs 
. . . There  might  exist  axioms  so  abundant  in  their  verifiable  consequences,  shedding  so 
much  light  upon  a whole  discipline,  and  furnishing  such  powerful  methods  for  solving 
given  problems  (and  even  solving  them,  as  far  as  that  is  possible,  in  a constructivistic 
way)  that  quite  irrespective  of  their  intrinsic  necessity  they  would  have  to  be  assumed  at 
least  in  the  same  sense  as  any  well  established  physical  theory. 


This  interestingly  undercuts  an  avowedly  realist  position  with  a pragmatism  that 
dilutes  the  force  of  “truth”,  but  is  resonant  with  subsequent  investigations,  par- 
ticularly of  the  Axiom  of  Determinacy  (Chapter  6).  See  Wang  [74:  200ff]  for 
more  on  Gddel’s  views  on  intrinsic  necessity  and  pragmatic  success  for  accepting 
new  axioms.  Godel  concluded  his  article  with  some  controversial  remarks  on 
mathematical  evidence  against  CH  (cf.  Martin-Solovay  [70: 176],  Martin  [76]). 

Whatever  can  be  said  about  Godel’s  proposals  and  despite  latter-day  refer- 
ences to  “Godel’s  program”,  it  is  unclear  how  much  actual  influence  they  had  on 
subsequent  developments.  True,  Tarski  who  was  to  make  an  important  contribu- 
tion in  1960  (§4)  heard  Godel’s  1946  Princeton  lecture,  but  a text  did  not  become 
generally  accessible  until  1965  in  Godel  [65].  The  article  [47]  was  widely  read, 
but  it  was  addressed  to  non-specialists.  And  nowhere  have  Godel’s  remarks  been 
acknowledged  as  having  been  an  inspiration.  In  the  1960’s  the  theory  of  large 
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cardinals  quickly  developed  a self-fueling  momentum,  and  blossomed  into  a so- 
phisticated branch  of  set  theory  far  overshadowing  Godel’s  epistemological  con- 
cerns. 

In  an  unpublished  footnote  20  toward  a 1966  revision  of  his  [47]  Godel  was 
to  acknowledge  ([90: 260ff])  the  new  developments,  matters  taken  up  in  §§4,5: 


In  recent  years  great  progress  has  been  made  in  the  area  of  axioms  of  infinity.  In 
particular,  some  propositions  have  been  formulated  which,  if  consistent,  are  extremely 
strong  axioms  of  infinity  of  an  entirely  new  kind  . . . Dana  Scott  . . . has  proved  that  one 
of  them  implies  the  existence  of  non-constructible  sets.  That  these  axioms  are  implied  by 
the  general  concept  of  set  in  the  same  sense  as  Mahlo’s  has  not  been  made  clear  yet  . . . 
However,  they  are  supported  by  strong  arguments  from  analogy,  e.g.,  by  the  fact  that  they 
follow  from  the  existence  of  generalizations  of  Stone’s  representation  theorem  to  Boolean 
algebras  with  operations  on  infinitely  many  elements. 


This  last  presumably  refers  to  strong  compactness  (see  4.1).  The  heuristic  of 
generalization  from  Ko,  like  reflection,  also  came  to  be  used  to  motivate  various 
large  cardinals.  Recalling  Cantor’s  unitary  view  of  the  finite  and  the  transfinite, 
large  cardinal  properties  satisfied  by  K0  would  be  too  accidental  if  they  were  not 
also  ascribable  to  higher  cardinals  in  an  eternal  recurrence. 

It  is  now  known  that  in  1942  Godel  had  developed  partial  results  toward  the 
independence  of  the  Axiom  of  Choice,  and  that  he  soon  abandoned  this  work  (see 
Moore  [88: 149-151]).  According  to  Kreisel  [80:201],  “With  present  experience 
it  is  not  too  difficult  to  complete  the  proof.  But  something  essential  - in  Godel’s 
words  (in  conversation):  a method  - had  been  missing  ...”  According  to  John 
Addison  (Moore  [88: 150]),  Gddel  feared  that  his  proof  would  lead  set-theoretic 
research  in  the  wrong  direction.  Rather  than  developing  relative  consistency  re- 
sults, the  concept  of  set  should  be  analyzed  more  carefully  and  new  axioms  sought 
that  would  simply  settle  issues  like  the  Continuum  Hypothesis.  Nonetheless,  in  a 
revised  postscript  toward  a 1966  revision  of  his  [47]  Godel  was  to  declare  (Godel 
[90:  270])  that  Paul  Cohen’s  work  on  the  independence  of  CH  “. . . is  the  greatest 
advance  in  the  foundations  of  set  theory  since  its  axiomatization”,  noting  that  it 
showed  in  particular  that  large  cardinals  have  no  direct  bearing  on  CH  (10.12ff). 

Whatever  the  subsequent  developments,  Godel’s  construction  of  L had  estab- 
lished the  minimum  possibility  for  the  set-theoretic  universe,  and  large  cardinals 
were  to  provide  the  counterweight  first  in  reaction  and  then  in  generalization. 
Scott’s  result  that  measurable  cardinals  contradict  V — L (§5)  inspired  research 
that  was  to  establish  the  intrinsic  necessity  of  large  cardinals  for  transcending  such 
hypotheses.  The  generalizations  of  constructibility  accommodating  measurability 
(§§20,21)  led  to  a full-blown  theory  of  minimal  models  for  large  cardinal  hy- 
potheses. And  throughout,  a crescendo  of  results  was  to  amply  demonstrate  the 
pragmatic  success  of  large  cardinals  in  settling  a large  variety  of  questions,  many 
about  definable  sets  of  reals.  In  these  more  subtle  ways  Godel’s  hopes  about  large 
cardinals  have  been  vindicated. 
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Inner  Models 

The  structural  generalizations  of  Godel’s  L began  with  abstractions,  relativizations, 
and  some  conditional  independence  results.  With  largely  proof-theoretic  aims  in 
mind  Shepherdson  [51,52,53]  carefully  formalized  and  studied  a general  notion 
of  “inner  model”.  The  term  will  be  reserved  for  a special  case:  For  a proper 
class  M, 

M is  an  inner  model  iff  M is  a transitive  e-model  of  ZF  with  On  c M . 

Such  M modeling  ZFC  are  specified  by  M is  an  inner  model  of  ZFC,  and  so  forth. 
These  notions  can  be  formalized  for  arbitrary  classes  M in  a theory  like  Morse- 
Kelley  where  the  class  satisfaction  relation  is  definable,  but  some  clarification  is 
necessary  for  formalization  in  ZF: 

(i)  By  “class”  in  the  ZFC  context  is  meant  definable  class,  i.e.  there  is  a 
formula  ip(v o, ...  ,v„)  and  sets  a i,  . . . , an  such  that  x e M is  merely  une  fagon 
de parler  for  ip[x , a\, . . . , a,,]. 

(ii)  Let  Inn(i>)  be  the  formula 

|Jt>  C v A Va(def(i>  fl  Va)  C v)  . 

Inn(u)  easily  implies  that  On  C v by  induction  on  ordinals  and  that  [t>  fl  Va  \ 
a e On}  C v.  Using  this  relativized  rank  hierarchy,  it  follows  as  for  L that  for 
any  axiom  a of  ZF,  with  oM  the  relativization  of  a to  M 

(*)  1~zf  Inn (M)  -*  oM 

(cf.  Devlin  [84:  60-63]).  Moreover,  for  any  transitive  class  N with  On  C N 
satisfying  \~zt  ctn  for  every  axiom  o of  ZF,  it  can  be  shown  that  \~zf  Inn(lV). 

Since  all  instances  of  ZF  axiom  schema  are  needed  in  the  proofs  of  the 
theorems  (*),  (i)  and  (ii)  constitute  a formalization  in  a weak  sense.  Nonetheless, 
Inn (M)  Mill  be  what  is  meant  by  the  assertion  that  M is  an  inner  model,  and 
M \=  (p  will  often  be  written  for  the  more  proper  (pM . 

The  archetypical  inner  model  is  L,  and  L C M for  any  inner  model  M since 
Lm  = L.  Because  of  this  Shepherdson  [53]  observed  that  the  relative  consistency 
of  theories  like  ZFC  H — 'GCH  cannot  be  established  by  relativization  to  an  inner 
model.  Furthermore,  as  noted  by  Godel  [38:557]  (and  more  clearly  in  [51:69]), 
some  large  cardinals  relativize  to  inner  models: 

3.1  Exercise.  If  M is  an  inner  model  of  ZFC  and  k is  an  a-inaccessible  cardinal, 
then  ( k is  a-inaccessible)M.  The  analogous  assertion  holds  for  a-Mahlo  cardinals. 

Hint.  Show  that:  (a)  if  k is  a strong  limit,  then  a:  is  a strong  limit  in  the  sense  of 
M,  and  (b)  if  k is  regular,  then  k is  regular  in  the  sense  of  M.  Now  use  induction 


on  a. 
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Despite  these  inherent  limitations  Andras  Hajnal  [56,61]  and  Levy  [57,60] 
in  their  doctoral  dissertations  in  Hungary  and  Israel  respectively  developed  basic 
generalizations  of  L which  will  be  central  to  our  later  concerns  about  the  effect  of 
large  cardinals  on  the  inner  structures  of  set  theory.  Moschovakis  [80: 489ff,  53  Iff] 
and  Drake  [74:  149ff]  contain  careful  expositions  on  the  following  models,  of 
which  only  the  salient  features  are  noted: 

Hajnal  essentially  provided  for  a given  set  A the  constructible  closure  L(A), 
i.e.  the  smallest  inner  model  M such  that  A e M.  If  there  is  no  concern  about  how 
the  elements  of  A are  to  be  incorporated  into  L(A),  the  Zermelo  [30]  urelement 
basis  idea  can  be  used  with  the  transitive  closure  tc({  A})  to  ensure  that  the  resulting 
class  is  transitive: 

L0(A)  = tc([A})  ; La+l{A)  = def(L„(A))  ; 

La  (A)  = U«<«L«(A)  for  limit  S > 0 ; 


and 

L(A)  = UaLa(A)  . 

Although  L(A)  is  indeed  an  inner  model,  unless  tc({ A})  has  a well-ordering  in 
L(A),  L(A)  does  not  satisfy  the  Axiom  of  Choice.  |L„(A)|  = |tc([A})|  • \a\  for 
a > co,  a result  established  by  induction  on  a. 

Levy  on  the  other  hand  developed  for  a given  set  A the  inner  model  L[A] 
of  sets  constructible  relative  to  A,  i.e.  the  smallest  inner  model  M such  that  for 
every  x e M,  A fl  x e M.  The  idea  is  to  define  a relativized  hierarchy  where 
assertions  about  membership  in  A can  be  made  of  sets  defined  thus  far.  Let 

defA(x)  = [y  c x \ y is  definable  over  (x,  e,  A fl  x)}  , 
making  A fix  available  as  a unary  relation  for  definitions.  In  analogy  with  L, 
L0[A]  = 0 ; La+i[A]  = defA(L„[A])  ; LS[A]  — Ua<aL„[A]  for  limit  8 > 0 ; 


and 

L[A]  = U aLa[A]  . 

Unlike  for  L(A)  what  remains  of  A is  only  A fi  L[A]  e L[A],  so  that  for  example 
L[R]  = L for  the  reals  R.  However,  L[A]  is  more  constructive  since  knowledge 
of  A is  incorporated  through  the  hierarchy  of  definitions,  and  like  L,  L[A]  satisfies 
the  Axiom  of  Choice.  |L„  [A]|  = |a|  for  a > co,  a result  established  by  induction  on 
a.  The  theory  of  L[A]  is  best  developed  with  structures  (L„[A],  e,  AnLa[A]>  for 
Ce(A),  the  language  of  set  theory  augmented  by  one  unary  predicate  symbol  A. 

As  with  L itself  a prominent  feature  of  the  theory  is  the  absoluteness 
of  the  definitions  involved,  which  leads  to  the  following  (see  Moschovakis 
[80:515,532]): 
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3.2  Proposition. 

(a)  If  M is  an  inner  model  with  A e M,  then  L(A)M  — L(A). 

(b)  If  M is  an  inner  model  with  An  M e M,  then  L[A  n M]m  — L[A], 

(c)  If  A n L[A]  — B C\  L[A],  then  L[A]  = L[B], 

(d)  L[A]  = L[A(1  L[AJ]  — L(A  n L[A]). 

(e)  If  AC  L,  then  L[A ] = L(A).  H 

For  future  reference,  some  results  about  L and  L[A]  are  stated  that  be- 
gin the  finer  analysis  of  constmctibility  (see  Devlin  [84:71,75]  or  Moschovakis 
[80:496,516,533]). 

3.3  Theorem. 

(a)  There  is  a sentence  oq  of  such  that  for  any  transitive  class  N, 

( N , e)  1=  (Jo  iff  N — L v N — Lg  for  some  limit  8 > co  . 

Also,  there  is  a formula  <po(vo,  i>i)  of  Ce  that  defines  in  L a well-ordering  <l  of 
L such  that  for  any  limit  S > a>,  any  y e L$,  and  any  x, 

x <L  y iff  X e Ls  a { Ls , e)  |=  <p0[-n  y]  . 

(b)  There  is  a sentence  eri  of  C€(A)  where  A is  unary  such  that  for  any  A 
and  any  transitive  class  N, 

( N , e,  A n N)  \=  cri  iff  N — L[A]  v N — Lg\A\  for  some  limit  8 > co  . 

Also,  there  is  a formula  q>\  (vo,  n)  of  Ce  (A)  that  in  any  (L[A],  e,  AnL[A])  defines 
a well-ordering  <l[a]  of  T[A]  such  that  for  any  limit  8 > co,  any  y e Lg\A\  and 
any  x, 

x <L[a]  y iff  xeLg[A]  A (Lj[A],  e,  A n LS[A])  \=  <p\[x,  y]  . H 

The  sentence  er0  leads  directly  to  Godel’s  Condensation  Lemma,  the  crux  of 
his  proof  of  GCH  in  L : If  8 > co  is  a limit  ordinal  and  ( //,  e)  is  an  elementary 
substructure  of  (Ls,  e),  then  ( H , e)  has  a transitive  collapse  (by  0.4)  that  must  be 
of  form  (La,  e)  for  some  a (because  of  ao). 

Although  differing  in  their  fomial  presentations,  both  Hajnal  and  Levy  used 
a set  of  ordinals  A so  that  L[A]  = L(A)  by  3.2(e),  and  the  distinctions  were  to 
surface  only  later.  Hajnal  and  Levy  (as  well  as  Shoenfield  [59]  who  formulated  a 
special  version  of  Levy’s  construction)  used  these  models  to  establish  conditional 
independence  results  of  the  sort:  If  ->CH  is  consistent,  then  so  is  — CH  together 
with  2X  = X+  for  sufficiently  large  k.  Hajnal’s  finer  analysis  led  to  a useful  fact: 
If  V — L[A]  and  A C k+,  then  2K  = k+ . In  particular,  if  ->CH  and  A c a>2  codes 
co 2 distinct  subsets  of  co  as  well  as  injections:  a ->  co\  for  every  a < co 2,  then 
= o)2  and  so  (2K°  = 2K>  = <u2)i[A1.  More  pointedly,  if  2K°  f &)2  is  provable 
in  ZFC,  then  so  is  CH.  All  this  anticipated  the  expected  independence  of  CH,  and 
providing  at  least  a semblance  of  continuity  Cohen  duly  established  this  with  his 
celebrated  method  of  forcing. 
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4.  Compactness 

For  almost  three  decades  after  1930  no  significant  advance  was  made  in  the  in- 
vestigation of  large  cardinals,  but  Alfred  Tarski  maintained  a steady  interest  in 
the  subject.  He  visited  the  United  States  from  Poland  in  1939,  but  the  outbreak 
of  war  precluded  his  return,  and  by  1942  he  was  established  at  the  University 
of  California  at  Berkeley.  Through  his  initiatives  he  was  to  play  a pivotal  role 
in  the  flowering  of  mathematical  logic  in  California,  and  Berkeley  became  the 
leading  center  for  set  theory  in  the  1960’s.  In  particular,  he  and  his  co-workers 
were  to  make  basic  contributions  to  the  theory  of  large  cardinals  through  the  infu- 
sion of  model-theoretic  methods.  This  section  describes  the  early  stages  of  these 
developments  and  brings  into  full  play  the  model-theoretic  preliminaries  of  §0. 

Combinatorial  elaborations  had  already  been  suggested  in  the  early  paper 
Erdos-Tarski  [43]  which  at  the  end  described  various  properties  of  cardinals  im- 
plying inaccessibility  (see  §7).  The  details  of  implications  asserted  there  were 
presented  in  an  influential  seminar  conducted  by  Tarski  and  Andrzej  Mostowski 
at  Berkeley  in  1958-9,  and  soon  appeared  in  Erdos-Tarski  [61].  It  was  against 
this  backdrop  that  Tarski’s  initiatives  in  another  direction  were  to  lead  to  a real 
breakthrough. 

Tarski  [58]  considered  the  semantics  of  the  infinitary  predicate  languages  Lx M 
and  later  raised  the  issue  of  their  possible  compactness.  In  brief,  an  LX[l  language 
is  formulated  as  follows:  Proceeding  as  for  the  usual  first-order  logic,  first  specify 
a supply  of  non-logical  symbols:  (finitary)  predicate,  function,  and  constant  sym- 
bols. These  together  with  an  allowed  supply  of  max([k,  /i})  many  variables  lead 
to  the  terms  and  atomic  formulas.  Then  the  usual  formula  generating  rules  are 
expanded  to  allow  conjunctions  /\^<a  and  disjunctions  V?<c  °f  a formulas  for 
any  a < X,  and  universal  quantifications  and  existential  quantifications  3^</j 
of  ji  variables  for  any  f < fi.  Finally,  a formula  is  an  expression  so  generated 
with  less  than  ji  free  variables,  this  to  allow  the  possibility  of  quantificational  clo- 
sure. Structures  for  interpreting  the  language  are  as  for  first-order  logic,  and  the 
satisfaction  relation  is  extended  to  incorporate  the  new  infinitary  connectives  and 
quantifiers  in  the  expected  way.  (For  book  studies  of  these  languages  see  Karp 
[64]  for  the  formal  syntax,  Keisler  [71]  for  the  model  theory  of  Lmw,  and  Dick- 
mann  [75]  for  the  general  model  theory;  Dickmann  [85]  provides  an  overview.) 
In  what  follows,  properties  of  LXfl  logic  that  are  straightforward  generalizations 
of  those  for  the  usual  first-order,  i.e.  Lm0J,  logic  are  taken  for  granted.  In  subse- 
quent sections  the  concepts  introduced  will  be  applied  in  terms  of  combinatorial 
characterizations  and  the  results  established  considerably  improved. 

Tarski  [62]  formulated  two  natural  generalizations  of  the  well-known  com- 
pactness property  of  L0XJJ:  A collection  of  LXfl  sentences  is  satisfiable  iff  it  has  a 
model  under  the  expected  interpretation  of  infinitary  conjunction,  disjunction  and 
quantification;  and  is  v-satisficible  iff  every  subcollection  of  cardinality  less  than 
v is  satisfiable.  For  k > a>. 
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k is  strongly  compact  iff  any  collection  of  LKK  sentences, 

if  /c-satisfiable,  is  satisfiable  . 

k is  weakly  compact  iff  any  collection  of  LKK  sentences  using 
at  most  k non-logical  symbols, 
if  /c-satisfiable,  is  satisfiable  . 

Tarski’s  original  formulation  of  weak  compactness  had  the  more  stringent  condi- 
tion \X\  = k and  does  not  imply  the  inaccessibility  of  /c  (Boos  [76]),  while  the 
one  that  is  adopted  here  does  (4.4)  which  is  the  modem  preference. 

By  the  early  1960’s  the  development  of  the  basic  ultraproduct  construction 
led  to  a surge  of  new  results  in  model  theory  and  an  enduring  interest  in  ultra- 
filters. The  general  construction  was  introduced  by  Jerzy  Los  [55],  where  the 
basic  theorem  0.6  is  implicit.  Then  Frayne-Morel-Scott  [62]  and  also  Kochen 
[61]  propagated  the  basic  theory  as  developed  by  the  authors  and  Tarski.  The  for- 
mer paper  provided  an  ultraproduct  proof  of  the  Compactness  Theorem,  a direct 
generalization  of  which  is  used  in  the  following  characterization. 

4.1  Proposition  (Keisler-Tarski  [64]).  k is  strongly  compact  iff  for  any  set  S, 
every  K-complete  filter  over  S can  be  extended  to  a K-complete  ultrafilter  over  S. 

Proof.  Suppose  first  that  /c  is  strongly  compact  and  F is  a /c-complete  filter 
over  a set  S.  Using  constants  X for  every  X c S,  let  E be  the  LKK  theory  of 
( S U V(S),  e,  X)Xqs  together  with  the  sentences  c e X for  every  X e F,  where 
c is  a new  constant.  X is  /c-satisfiable  as  F is  /c-complete,  so  let  M model  X by 
strong  compactness.  Now  define  U by: 

XeU  iff  XcSAM\=ceX. 

It  is  simple  to  check  that  U is  an  ultrafilter  over  S extending  F,  and  X has  the 
Lkk  sentences  which  ensure  that  U is  /c-complete. 

For  the  converse,  note  first  that  /c  must  be  regular:  If  to  the  contrary  /c  were 
singular,  then  /c-completeness  would  readily  imply  /^-completeness  for  filters. 
But  then,  if  U is  any  /c-complete  ultrafilter  over  /c+  extending  the  /c-complete 
(even  /c+-complete)  filter  {X  C /c+  | |/c+  — X\  < /c+],  then  U would  also  be 
/c+-complete  and  so  /c+  would  be  measurable,  contradicting  Ulam’s  2.4. 

Suppose  now  that  X — {cra  \ a < A.}  is  a /c-satisfiable  collection  of  LKK 
sentences.  Recall  that  VKX  = (x  c 1 |x|  < /c}.  For  any  x e VKX,  let  Mx  be  a 
structure  for  the  language  of  X so  that  Mx  1=  /\aex  cr„.  With  the  availability  of 
Mx  we  can  assume  that  X > k.  As 

{{x  e VkX  | y c x}  | y e PKX] 

generates  a /c-complete  filter  over  VKX  by  the  regularity  of  /c,  let  U be  a /c- 
complete  ultrafilter  over  VKX  extending  this  filter.  Consider  the  ultraproduct  M — 
Ur  x-M-x/U-  It  is  straightforward  to  check  that,  essentially  by  the  same  proof 
as  for  Lox„,  Los’s  Theorem  0.6  holds  for  LKK  and  ultraproducts  by  /c-complete 
ultrafilters.  Since  for  any  a < X, 


38 


Chapter  1 . Beginnings 


{.X  G T> fX  | . V! , |=  (Ta]  ^ {x  G P^X  | O'  G x}  G £/  , 
it  follows  that  M |=  cra. 

An  analogous  characterization  exists  for  weak  compactness  in  terms  of  an 
appropriately  curtailed  filter  extension  property  (Keisler-Tarski  [64: 288]). 

The  next  two  results  were  first  observed  in  equivalent  formulations. 

4.2  Corollary  (Erdos-Tarski  [43:328]).  If  k is  strongly  compact,  then  k is  mea- 
surable. 

Proof,  k is  regular  by  the  proof  of  4.1.  The  filter  {X  C k | |at  — X\  < k] 
is  consequently  /c-complete,  and  any  k -complete  ultrafilter  over  k extending  it 
verifies  the  measurability  of  k.  H 

4.3  Proposition  (Erdos-Tarski  [43:328]).  If  k is  measurable,  then  k is  weakly 
compact. 

Proof.  Suppose  that  £ be  a x-satisfiable  collection  of  LKK  sentences  using  at  most 
k non-logical  symbols.  Then  it  is  simple  to  see  that  £\  < k<k.  But  measurable 
cardinals  are  inaccessible,  so  | X|  < k.  With  (oa  \ a < k)  enumerating  £ we  can 
now  proceed  as  for  4. 1 : 

For  each  f < k let  Mp  1=  /\a<p(Ja-  Let  U be  any  k -complete  ultrafilter 
over  k,  and  set  M — nk  M-p/U.  Then  for  any  a < k, 

{f  < k I Mp  N cra}  2 {yS  < k | ft  > a]  G U , 

and  so  M 1=  er„.  H 

Finally,  weak  compactness  entails  inaccessibility: 

4.4  Proposition.  If  k is  weakly  compact,  then  k is  inaccessible. 

Proof.  To  show  that  k is  regular,  assume  to  the  contrary  that  X C k is  unbounded 
yet  |X|  < k.  Then  for  distinct  constants  c and  ca  for  a < k, 

{c=fca  |ff<((}U  {V/5exVa</Sc  = cff} 

is  x-satisfiable  yet  not  satisfiable  - a contradiction. 

To  establish  that  k is  a strong  limit,  assume  to  the  contrary  that  there  is  a 

X < k such  that  21  > x.  Then  for  distinct  constants  ca  and  d'a  for  a < X and 

i < 2, 

{Ac<J(c«  = 4 v cff  = 4)  a d°a  f 4]}  U {V0<i(c„  # 4^)  \ f ex2}  . 

is  not  satisfiable,  else  any  interpretation  of  (c«  | a < X)  would  correspond  to  a 

function:  X ->  2 different  from  every  member  of  x2,  yet  by  similar  reasoning  any 
proper  subset  is  satisfiable  - a contradiction.  H 
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Presumably  with  the  old  question  of  whether  the  least  measurable  cardinal 
is  strictly  larger  than  the  least  inaccessible  cardinal  in  mind,  Tarski  suggested 
to  his  student  William  Hanf  at  Berkeley  that  he  investigate  the  possible  weak 
compactness  of  inaccessible  cardinals.  Using  appropriate  LKK  sentences  Hanf  [64] 
in  1960  was  able  to  establish  that  there  are  in  fact  many  inaccessible  cardinals 
below  a weakly  compact  cardinal;  notably,  the  early  proofs  were  directly  analogous 
to  Tarski’s  construction  of  a consistent  yet  ^-inconsistent  theory.  Thus,  the  least 
measurable  cardinal  is,  a fortiori,  strictly  larger  than  the  least  inaccessible  cardinal 
(Tarski  [62]). 

This  Hanf-Tarski  breakthrough  was  the  first  result  about  the  size  of  measur- 
able cardinals  since  Ulam’s  original  paper.  It  was  greeted  by  Abraham  Robinson 
[62: 78]  as  “a  spectacular  success”  for  metamathematical  methods.  Tarski  himself 
opined  ([62: 125])  that  it  was  “contrary  to  expectations”;  presumably,  inaccessibil- 
ity should  have  sufficed  to  generalize  the  compactness  property  of  Lam.  Several 
times  in  the  development  of  set  theory,  such  key  results  have  reoriented  the  col- 
lective set-theoretic  intuition  and  spurred  a spate  of  new  research.  Hanf  s work 
radically  altered  size  intuitions  about  problems  that  were  coming  to  be  understood 
in  terms  of  large  cardinals. 

Several  methods  are  now  known  to  establish  Hanf  s result,  two  quickly  dis- 
covered in  its  wake:  Keisler  [62]  via  ultrapowers  and  Hanf-Scott  [61]  via  inde- 
scribability  (§6).  We  shall  proceed  without  much  ado  from  an  equivalent  formu- 
lation of  weak  compactness.  Like  the  recursive  functions  weak  compactness  has 
many  diverse  characterizations,  which  speaks  to  the  naturalness  and  centrality  of 
the  concept.  More  will  be  encountered  in  coming  sections  as  different  themes 
are  pursued.  Most  of  the  equivalences  were  established  in  the  early  1960’s,  and 
later  summarized  in  Silver  [71: 62]  and  Devlin  [75]  - see  also  Comfort-Negrepontis 
[74: 185]  for  topological  equivalences  and  Dickmann  [75:  184]  for  model-theoretic 
equivalences. 

H.  Jerome  Keisler  established  the  following  result,  generalizing  the  use  of  the 
Compactness  Theorem  to  get  proper  extensions  of  models.  A student  of  Tarski  at 
Berkeley  and  later  a prominent  model  theorist,  Keisler  made  important  contribu- 
tions at  this  formative  stage  by  providing  useful  model-theoretic  characterizations 
as  well  as  ultraproduct  proofs  (cf.  5.6). 

4.5  Theorem  (Keisler  [62,62a]).  k is  weakly  compact  iff  k has  the  Extension 
Property:  for  any  R C VK  there  is  a transitive  set  X f VK  and  an  S C X such  that 
(Vk,€,R)^  (X,e,S). 

Remarks.  -<  denotes  the  assumed  concept  of  elementary  substructure;  it  is  re- 
viewed in  a larger  setting  at  the  beginning  of  §5.  Note  that  since  X VK  is  to  be 
transitive,  k e X. 

Proof.  Suppose  first  that  k is  weakly  compact  and  fi  c yt,  k is  inaccessible  by 
4.4,  so  that  | VK\  = k.  Using  distinct  constants  c and  x for  each  x e VK,  let  X be 
the  LKK  theory  of 
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{VK,  e,  R , x)xeyK 

together  with  the  sentences  1 c is  an  ordinal1  and  ^ a1  for  each  a < k.  Then  £ 
is  clearly  a: -satisfiable,  and  so  by  weak  compactness  it  is  satisfiable.  Of  course, 
the  Axiom  of  Extensionality  is  in  £ . Also,  well-foundedness  is  readily  seen  to  be 
expressible  already  in  L0J]  M| , and  so  £ has  a member  saying  e is  well-founded. 
Hence,  by  the  Collapsing  Lemma  0.4  £ has  a transitive  model 


( X , e,  S,x,  y)xeVx 

where  y interprets  c.  For  any  x e VK,  £ contains  the  sentence 

Vv(v  € X **  Vyex  v = y)  , 

so  by  induction  on  rank,  x e VK  implies  that  x — x.  Clearly  y is  an  ordinal  > k 
so  X ^ VK,  and  hence  the  reduct  {X,  e,  S)  satisfies  all  the  requirements. 

For  the  converse,  first  observe  that  the  Extension  Property  for  k implies  its 
inaccessibility:  Assume  first  that  there  is  a /x  < k,  a cofinal  map  F:  /x  -*  k,  and  a 
proper  extension  { VK , e,  F)  < (X,  e,  S)  where  X is  transitive.  Then  3 x(F“fi  c x) 
fails  in  the  first  structure  yet  3x(5'“/x  c x)  holds  in  the  second,  which  is  a 
contradiction.  Assume  next  that  there  is  a v < k such  that  k < 2V . Then 
2V  < | Vv+\  and  Vv+\  e VK,  and  so  a surjection  G:  Vv+\  ->  k can  be  used  to 
again  derive  a contradiction. 

Next,  note  that  for  any  inaccessible  X the  following  two  Ldwenheim-Skolem 
theorems  for  Ln  are  verifiable  by  essentially  the  same  arguments  as  for  Lmco  (0.5): 

(i)  If  a is  a satisfiable  LXx  sentence,  then  it  has  a model  of  cardinality 
less  than  X . 

(ii)  If  £ is  a satisfiable  collection  of  Lu  sentences  of  cardinality  at 
most  X , then  it  has  a model  of  cardinality  at  most  X . 

Suppose  now  that  £ is  a k -satisfiable  collection  of  LKK  sentences  using  at 
most  k non-logical  symbols.  The  corresponding  LKK  language  has  k formulas  by 
the  inaccessibility  of  k , and  so  we  can  carry  out  an  arithmetization  of  the  language 
that  codes  formulas  with  members  of  VK  and  formulate  the  satisfaction  relation  for 
set  structures  in  VK  as  a class  of  VK.  Let  R\  c VK  code  this  satisfaction  relation, 
and  construing  £ to  be  a subset  of  Vk  through  the  arithmetization  let  Ry.  k ->  £ 
be  surjective.  Then  by  (!)  above 

{VK,  e,  R\,  R2)  |=  Va(/?2  a has  a model)  . 

By  the  Extension  Property  (with  {R\,  R2)  coded  by  a single  subset  of  Vk)  let 
(VK,  e,  R] , R2)  < ( X , e,  S 1,  S2)  be  a proper  extension  where  X is  transitive.  By 
elementarily  S2“k  — R2  K ^ and  e,  Si,  S2)  \=  £ has  a model.  But  by 
the  inaccessibility  of  k,  (Vk,  e)  and  hence  (Z,  e)  models  ZFC,  and  since  X is 
transitive  and  k e X,  (X,  e)  \=  k is  inaccessible  (cf.  3.1).  It  follows  by  (ii)  above 
that 
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(X,  e,  Si,  S2>  N E has  a model  M.  with  domain  c k . 

Finally,  it  can  be  assumed  that  the  LKK  satisfaction  relation  Si  for  M is  upward 
persistent  (in  fact  absolute)  between  X and  V as  <kk  c Vk  c X:  as  in  the  Lw(0 
case  the  relation  for  M is  E\  (and  also  Il\).  Hence,  M really  models  X in  V.H 

Although  ( X , e)  |=  k is  inaccessible,  it  cannot  in  general  be  asserted  that 
(X,  e)  |=  k is  weakly  compact:  X may  not  contain  all  the  needed  models  of 
K-satisfiable  sets  of  sentences.  Otherwise,  (VK,  e)  |=  3 a(a  is  weakly  compact) 
by  elementarity,  and  hence  by  a simple  argument  there  really  would  be  a weakly 
compact  cardinal  less  than  k.  This  cannot  happen,  of  course,  if  k is  the  least 
weakly  compact  cardinal.  Large  cardinal  theory  is  replete  with  this  sort  of  near 
miss. 

Toward  Hanf  s result,  a result  about  reflecting  stationary  sets  is  derived: 

4.6  Proposition.  Suppose  that  k is  weakly  compact.  Then: 

(a)  If  A is  stationary  in  k,  then  there  is  an  inaccessible  X < k such  that  A IT  A, 
is  stationary  in  X. 

(b)  If  Aa  is  stationary  in  k for  each  a < k,  then  there  is  an  inaccessible  X < k 
such  that  Aa  fl  X is  stationary  in  X for  each  a < X. 

Proof,  (a)  Assume  to  the  contrary  that  for  each  f < k,  either  § is  not  inaccessible 
or  else  there  is  a C|  closed  unbounded  in  £ such  that  Q 0 A = 0.  Let  R : k VK 
be  defined  by:  R(f)  — Q if  £ is  inaccessible  and  = 0 otherwise.  By  4.5  (with 
(A,  R)  coded  by  a single  subset  of  VK)  let  {VK,  e,  A.  R)  -<  (X,  e,  B , S)  be  a proper 
extension  with  X transitive.  Then 

(X,  e,  B,  S)  ^ re  is  inaccessible  A 

S(k)  is  closed  unbounded  in  k a S(k)  fl  B — 0 . 

But  then,  S(k)  really  is  closed  unbounded,  and  since  B Dk  — A,  S'(Ar)nA  = 0 
contradicting  the  assumption  that  A is  stationary. 

(b)  The  argument  is  similar,  using  the  assumption  Q fl  Aa  = 0 for  some 
a < $ and  A = {(a,  fi)  | ft  e A„}  instead.  H 

4.7  Corollary  (Hanf  [64]).  If  k is  weakly  compact,  then  k is  K-Mahlo. 

Proof.  We  show  that  k is  a-Mahlo  for  a < re  by  induction  on  a.  The  basis  and 
limit  cases  are  immediate,  so  it  remains  to  argue  from  a to  a + 1 : 

Suppose  that  k is  a-Mahlo.  Then  for  /l  < a,  Ap  — {£  < k \ % is  /S-Mahlo} 
is  stationary  in  k.  Let  C be  closed  unbounded  in  k.  Then  by  4.6(b)  there  is 
an  inaccessible  X < k such  that  C fl  X and  Ap  fl  X for  /3  < a are  all  stationary 
in  X.  Hence,  X e C and  X is  a-Mahlo.  Since  this  obtains  for  any  such  C, 
[f  < k | | is  a-Mahlo}  is  stationary  in  k , i.e.  k is  (a  + l)-Mahlo.  H 

The  full  force  of  4.6(b)  leads  to  the  conclusion  that  {£  < k \ § is  f-Mahlo} 
is  stationary  in  k.  Indeed,  Hanf  [64]  extended  the  Mahlo  hierarchy  with  a diago- 
nalization  operator  related  to  4.6(b)  and  established  that  weakly  compact  cardinals 
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are  high  in  this  hierarchy.  He  wrote  ([64:313]):  “However,  it  turns  out  that  for 
each  class  defined  in  such  a constructive  way,  it  appears  almost  certain  on  the 
basis  of  “naive”  set  theory  (or  what  is  sometimes  called  “Cantor’s  absolute”)  that 
not  all  cardinals  belong  to  the  class.”  That  is,  his  arguments  in  no  way  disallow 
weakly  compact  cardinals. 

Tarski  [62: 134]  did  make  a clear  distinction  between  inaccessible  and  weakly 
compact  (“not  strongly  incompact”)  cardinals: 


For  one  thing,  the  belief  in  the  existence  of  inaccessible  cardinals  > co  (and  even  of 
arbitrarily  large  cardinals  of  this  kind)  seems  to  be  a natural  consequence  of  basic  intuitions 
underlying  the  ‘naive’  set  theory  and  referring  to  what  can  be  called  ‘Cantor’s  absolute’. 
On  the  contrary,  we  see  at  this  moment  no  cogent  intuitive  reasons  which  could  induce 
us  to  believe  in  the  existence  of  cardinals  > co  that  are  not  strongly  incompact,  or  which 
at  least  would  make  it  very  plausible  that  the  hypothesis  stating  the  existence  of  such 
cardinals  is  consistent  with  familiar  axiom  systems  of  set  theory. 


However,  in  a contrasting  passage  he  wrote  later  on  the  page: 


We  would  of  course  fully  dispose  of  all  the  problems  involved  if  we  decided  to  enrich  the 
axiom  system  of  set  theory  by  including  (so  to  speak,  on  a permanent  basis)  a statement 
which  precludes  the  existence  of  ‘very  large’  cardinals,  e.g.  by  a statement  to  the  effect 
that  every  cardinal  > co  is  strongly  incompact  [not  weakly  compact].  Such  a decision, 
however,  would  be  contrary  to  what  is  regarded  by  many  as  one  of  the  main  aims  of 
research  in  the  foundations  of  set  theory,  namely  the  axiomatization  of  increasingly  large 
segments  of  ‘Cantor’s  absolute’.  Those  who  share  this  attitude  are  always  ready  to  accept 
new  ‘construction  principles’,  new  axioms  securing  the  existence  of  new  classes  of  ‘large’ 
cardinals  (provided  they  appear  to  be  consistent  with  old  axioms),  but  are  not  prepared 
to  accept  any  axioms  precluding  the  existence  of  such  cardinals  - unless  this  is  done  on 
a strictly  temporary  basis,  for  the  restricted  purpose  of  facilitating  the  metamathematical 
discussion  of  some  axiom  systems  of  set  theory. 


Although  Tarski’s  words  are  somewhat  guarded,  the  influence  of  Cantor’s  realist 
conception,  perhaps  refined  by  Godel’s  remarks,  is  evident. 

Keisler  and  Tarski  detailed  the  results  and  interconnections  of  the  theory  to 
date  in  their  comprehensive  paper  [64].  Although  they  believed  that  the  original 
metamathematical  methods  “provide  the  intuitive  and  deductively  simplest  ap- 
proach” to  the  results,  they  nevertheless  labored  with  ultrapower  proofs  to  provide 
a “purely  mathematical  treatment”.  They  continued  ([64: 226]): 


We  have  been  motivated  by  the  realization  of  the  practical  fact  that  the  knowledge  of 
metamathematics  is  not  sufficiently  widespread  and  may  be  defective  among  mathemati- 
cians who  would  otherwise  be  intensely  interested  in  the  topic  discussed,  and  to  a certain 
extent  also  by  some  (irrational)  inclination  toward  puritanism  in  methods.  As  will  be 
seen  from  the  remarks  below,  we  do  not  feel  that  we  have  been  entirely  successful  in  our 
undertaking. 


Indeed,  they  first  established  that  measurable  cardinals  are  high  in  the  Mahlo 
hierarchy  using  the  ultrapower  approach  evolving  from  Keisler  [62]  (see  5.14), 
but  only  sketched  the  sharper  results  for  weakly  compact  cardinals  because  of  the 
expository  difficulties  involved.  It  is  never  simple  to  sublimate  the  satisfaction 
relation! 
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As  large  cardinal  theory  developed,  metamathematical  methods  became  basic 
to  the  subject  in  the  interplay  of  inner  models,  forcing,  and  elementary  embed- 
dings. Symptomatic  of  this  is  how  the  details  of  structures  and  satisfaction  so 
painstakingly  set  out  in  the  early  papers  were  soon  replaced  by  casual  expositions 
in  informal  rigor.  With  this  development  the  field  became  an  advanced  and  sophis- 
ticated part  of  the  mathematical  enterprise  with  less  direct  concern  for  foundational 
issues. 

The  discussion  in  Keisler-Tarski  [64]  is  framed  around  the  classes  Co,  Cj  and 
C2  of  cardinals  that  are,  respectively,  not  weakly  compact,  not  measurable,  and 
not  strongly  compact.  To  the  set  theorist  of  today,  flipping  through  its  pages  is  a 
curious  experience,  as  he  repeatedly  encounters  theorems  which  he  himself  would 
state  in  dual  or  contrapositive  form.  The  several  theorems  with  hypotheses  of  form 
“if  C,  contains  all  the  cardinals”,  while  not  necessarily  espousing  these  hypotheses 
according  to  the  previous  Tarski  quotation,  are  at  least  suggestive  of  a uniform, 
orderly  cosmos  delimited  by  pathologies  beyond.  In  the  next  wave  of  research 
by  Gaifman,  Rowbottom,  Silver,  Solovay,  Kunen,  and  others,  such  universal  con- 
straints were  replaced  by  existential  postulations  and  positive  implications  about 
the  possible  new  richness  of  the  set-theoretic  universe.  Perhaps  it  is  a coincidence, 
but  it  was  by  this  time  that  the  terminological  move  from  the  half-empty  notion 
of  prime  ideal  to  the  half-full  dual  notion  of  ultrafilter  was  completed  in  model 
theory. 
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5.  Elementary  Embeddings 

In  the  early  1960’s  set  theory  was  veritably  transformed  by  structural  initiatives 
based  on  new  possibilities  for  constructing  well-founded  models  and  establish- 
ing relative  consistency  results.  This  of  course  was  due  largely  to  the  creation 
of  forcing  by  Cohen,  who  came  upon  a remarkably  fertile  method  for  producing 
extensions  of  models  of  set  theory.  Fine  mathematicians  like  Solovay  quickly 
perceived  the  possibilities  abounding,  and  within  a few  years  the  method  of  forc- 
ing was  systematized,  and  a cornucopia  of  relative  consistency  results  was  being 
produced  (§10). 

A seminal  result  of  Scott  provided  the  impetus  in  another  direction,  the  in- 
vestigation of  elementary  embeddings.  Scott  had  begun  to  work  in  the  1950’s, 
initially  with  Tarski  at  Berkeley,  on  a variety  of  foundational  issues.  It  is  the 
nuts  and  bolts  of  forcing  to  establish  the  relative  consistency  with  ZF  of  V f L, 
but  Scott  [61]  was  the  first  to  realize  the  possibility  by  showing:  If  there  is  a 
measurable  cardinal,  then  V f L.  The  difference  is  crucial:  instead  of  a relative 
consistency  result  he  had  established  a direct  implication  from  a prior  principle.  It 
was  in  such  ways  that  Godel’s  hopes  for  large  cardinals  were  to  be  vindicated;  his 
axiom  V = L had  imposed  a consistent  delimitation  on  ZFC,  and  developments 
from  Scott’s  result  were  to  establish  the  necessity  of  large  cardinals  for  securing 
a real  transcendence  over  L (§9). 

Scott’s  work  began  the  liberal  use  of  manipulative  proper  class  constructions 
in  set  theory  and  led  to  a structural  characterization  of  measurable  cardinals  via 
ultrapowers  and  elementary  embeddings.  Setting  the  stage  Keisler  [62]  (and  in  full 
exposition  Keisler-Tarski  [64])  had  used  ultrapowers  within  set  theory  to  provide 
an  alternate  proof  of  Hanf  s result  4.6,  and  Scott  then  exploited  the  global  approach 
of  taking  an  ultrapower  of  V itself.  As  with  inner  models  the  formalization  of 
such  class  concepts  requires  care: 

We  first  develop  the  model-theoretic  concept  of  elementary  embedding  for 
structures  with  possibly  proper  class  domains,  remembering  that  such  structures 
can  be  recast  as  single  classes  through  definability.  To  review,  for  structures 
M o = (Mo,...)  and  Mi  = (Mi,...)  for  a language  C an  injective  function 
j:  Mo  -*  Mi  is  an  elementary  embedding  of  Mo  into  Mi,  denoted  j:  Mo  < Mi, 
iff  it  satisfies  the  elementarily  schema : for  any  formula  <p( Vi, . . . , v„)  of  C and 
xi, . . . , x„  e M0, 

(*)  Mo  |=  (p[x\ xn]  iff  Mi  |=  (p[j{x\), j (xn)]  . 

If  j is  also  the  identity  map  on  Mo,  then  Mo  is  an  elementary’  substructure  of 
M\,  denoted  Mo  -<  M Most  structures  to  be  considered  will  be  for  some 
extension  of  C€,  the  language  of  set  theory.  A further  specialization  is  to  €- 
models,  structures  of  form  M — (M,  e),  usually  denoted  by  just  M.  If  Mo  and 
M\  above  are  both  of  this  form,  (*)  can  be  expressed  in  terms  of  relativized 
formulas  of  Ce: 
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(**)  <pM°[xi,  ...,xn\  iff  (pM'[j(xi),...,j(xn)]. 

Similarly,  if  t(vi, . . . , v„)  is  a definable  term  of  C€,  x\, ...  ,xn  e M0,  and 
tM°[x i, . . . , x„]  e Mo,  then 

j(tMo[xi, . . .,xn])  = tM,[j(x i), . . . , j(x, ,)]  . 

All  this  is  formalizable  for  classes  in  terms  of  the  satisfaction  relation  for  classes, 
definable  in  a theory  like  Morse-Kelley,  but  not  in  ZFC  because  of  the  Godel- 
Tarski  undefinability  of  truth.  However,  we  will  be  dealing  primarily  with  set 
structures  for  which  the  formalization  can  be  carried  out,  or  inner  models,  for 
which  a satisfactory  approach  to  formalization  is  available: 

Let  j be  as  specified  before  (*)  with  C an  extension  of  Ce.  Referring  to  the 
Levy  hierarchy  of  formulas  (§0), 

j is  a £ ,,-elementary  embedding , denoted  j:  Ado  ~<n  Adi, 
iff  (*)  holds  for  the  £„  formulas  . 

If  j is  also  the  identity  map  on  Mo,  then  Ado  is  a £n-elementary  substructure 
of  Adi,  denoted  Ado  -<„  Adi.  In  particular,  for  any  class  M,  that  M -<„  V 
(understood  as  (M,  e)  -<n  ( V , e))  is  just  the  assertion  that  the  £„  formulas  are 
absolute  for  M.  Note  that  j:  Ado  -<„  Adi  implies  that  (*)  also  holds  for  the 
77„  formulas,  and  in  fact  for  all  Boolean  combinations  of  £„  formulas.  Also, 
if  Ado  = (Mo,  e),  Adi  = (M i,e}.  Mo  c Mi,  and  both  are  transitive,  then 
Ado  ~<o  Adi  since  £q  formulas  are  absolute  for  transitive  classes. 

For  any  particular  integer  n,  the  satisfaction  relation  for  £,,  formulas  for  a 
transitive  proper  class  structure  is  formalizable  in  ZF  (§0).  Thus,  the  concept  of 
£n  -elementary  embedding  is  formalizable  in  ZF  even  for  such  classes.  Focusing 
on  inner  models,  some  simple  observations  can  be  made  using  0.2,  according  to 
which  dank(i>o)  = ui1  is  AfF  and  rV„0  = ui1  is  nfF. 

5.1  Proposition.  Suppose  that  Mo  and  M\  are  inner  models  and  j:  Mq  < \ M\. 
Then: 

(a)  For  any  ordinal  a,  j (a)  is  an  ordinal  > a. 

(b)  Suppose  that  j is  not  the  identity,  and  either  M\  C Mo  or  Mq  |=  AC. 
Then  j (5)  > S for  some  ordinal  S. 

(c)  For  any  particular  integer  n,  j:  Mq  -<„  M\. 

Proof,  (a)  Being  an  ordinal  is  £qF:  transitive  and  linearly  ordered  by  e.  That 
j (a)  > a follows  by  induction. 

(b)  Suppose  first  that  Mi  c Mo.  Let  x be  of  least  rank  such  that  j(x)  f x, 
and  set  S =rank(x).  Note  that  x c j (x),  since  y e x implies  that  y = j (y)  e j (x). 
Hence,  there  must  be  a z e j(x)  — x.  If  rank(j'(-v))  = 8,  then  since  z e Mi  c M0, 
j can  be  applied  to  get  j (z)  — z e j(x),  leading  to  the  contradictory  z e x. 
Hence,  rank(j  (x))  > 8,  but  also  rank(/'(x))  = j (8)  by  0.2. 

Suppose  next  that  Mq  |=  AC.  Assume  to  the  contrary  that  j(8 ) = 8 for  every 
ordinal  8.  Note  first  that  for  any  set  x e Mo  of  ordinals,  j (x)  is  also  a set  of 
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ordinals,  and  so  the  argument  of  the  previous  paragraph  shows  that  j(x ) = x. 
How  transitive  e-models  of  ZFC  are  determined  by  their  sets  of  ordinals  is  now 
exploited: 

Suppose  that  a £ Mq.  Let  b — tc({«}),  the  transitive  closure  of  {a}  (in  V and 
hence  also  in  Mo).  Since  Mo  |=  AC,  there  is  a bijection  e e Mq  between  some 
ordinal  y and  b.  Let  E e M0  be  the  binary  relation  on  y defined  by:  (a,  f)  £ E 
iff  e(a)  £ e(/6).  Identifying  E with  a set  of  ordinals  through  a pairing  function,  it 
follows  from  the  previous  paragraph  that  j(E)  = E.  Now  the  function  e verifies 
that  £ is  a well-founded  relation  in  V as  well  as  in  Mq.  Since  every  non-empty 
subset  of  y has  an  £-minimal  element  in  V,  this  also  holds  in  M\,  i.e.  E is 
well-founded  in  M\  as  well.  The  Collapsing  Lemma  0.4  can  now  be  applied  in 
Mj  as  (y,  E)  is  also  extensional  there,  and  by  uniqueness  the  transitive  collapse 
is  {b,  e)  £ M\.  Since  j is  Z\ -elementary,  j(E)  — E,  and  {b,  e)  is  the  unique 
transitive  collapse  of  {y,  E)  in  both  Mq  and  M\,  it  follows  that  j(b)  — b.  Using 
0.2  it  then  follows  that  j(a)  — a,  since  a is  definable  as  that  member  of  b of 
largest  rank.  But  a e Mq  was  arbitrary,  and  so  j is  the  identity  - a contradiction. 

(c)  The  proof  proceeds  by  informal  induction  up  to  any  particular  integer:  As- 
sume that  j:  Mq  -<„  Mi  with  n > 1,  and  suppose  that  (p  is  E„+ 1 , say  3vq\I/(vq , v\) 
where  f is  Eln  for  simplicity,  and  that  x £ Mq.  If  Mq  |=  <p[x],  then  Mq  |=  \f[y,  x] 
for  some  y e Mq.  Hence  Mi  |=  j (x)]  and  so  Mi  |=  cp[j  (v)]. 

For  the  converse,  suppose  that  Mi  |=  <p[j  (x)],  say  Mi  [=  \f[z,  j(x)].  By  (a) 
there  must  be  an  a such  that  z e and  by  0.2,  = j ( VaM° ) . Hence, 

Mi  ^ 3 u0  £ v2ty[j(x),  j (U„Mo)]  . 

(3do  £ D21A  is  a formula  in  two  free  variables  Di  and  V2,  interpreted  respectively  by 
j(x)  and  But  as  observed  in  §0,  3do  £ D2V''  is  still  77;^F  by  applications 

of  Replacement.  Hence, 

Mo  h 3 d0  £ V2ir[x,  V^]  , 

and  so  Mq  [=  (p[x],  H 

(c)  shows  that  for  inner  models  E\ -elementary  embeddings  provide  an  ade- 
quate formalization  in  ZF  of  the  informal  concept  of  elementary  embedding,  since 
in  any  particular  argument  only  finitely  many  instances  of  the  elementarity  schema 
are  needed.  The  argument  for  (c)  works  under  rather  general  conditions  on  ar- 
bitrary models  of  ZF  as  long  as  y“OnM°  is  cofinal  in  OnMl;  see  Gaifman  [74], 
which  provides  another  approach  to  formalization  in  many  cases. 

Henceforth,  with  (c)  in  mind  but  anticipating  more  general  situations, 

by  elemental y embedding  between  transitive  proper  class 
£-structures  is  meant  E\ -elementary  embedding  . 

Also,  to  avoid  trivialities  it  will  be  implicit  that 

elementary  embeddings  are  not  the  identity  on  their  domains  . 
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The  least  ordinal  as  in  5.1(b)  will  thus  play  a critical  role:  For  any  j:  N ->  M, 

8 = crit(  j),  the  critical  point  of  j,  iff  8 is  the  least  ordinal  moved  by  j . 

Generally,  elementary  embeddings  will  be  handled  informally,  with  remarks 
on  formalizability  in  ZFC  made  only  when  it  may  be  problematic.  Such  a circum- 
stance involves  quantification  over  proper  class  elementary  embeddings.  Flowever, 
in  each  particular  case  the  putative  quantification  will  be  one  that  can  be  recast  as 
one  over  sets.  Formalizability  is  usually  a by-product  of  the  developing  theory, 
and  elementary  embedding  formulations,  particularly  of  strong  hypotheses,  are 
often  more  perspicuous. 

Taking  the  ultrapower  of  the  universe  V by  an  ultrafilter  U over  a set  S is 
our  next  concern.  The  general  ultraproduct  construction  was  reviewed  in  §0  and 
applied  in  §4.  The  first  obstacle  in  formally  defining  an  ultrapower  of  V is  that 
for  any  /:  S — > V,  its  U equivalence  class 

(f)u  = {g\g:S-+V  a {icS  | f(i)  = g(i)}  e U] 

is  a proper  class.  Flowever,  using  a device  from  Scott  [55]  let 

(f)°u  = {g  I g e (f)u  A Vh(h  e ( f)v  -a  rank(g)  < rank(/!))}  , 

i.e.  those  members  of  (f)v  of  minimal  rank.  Then  (/)£,  is  a set,  and  so  the 
domain  of  the  ultrapower  can  be  formulated  as  the  class 

SV/U  = {{f)°v  | f.S^V) 

definable  from  U,  with  its  membership  relation  Ev  defined  as  expected: 

(f)°u  Ev  (g)^  iff  {/  e 5 | f(i)  e g(i)}  e U . 

The  ultrapower  can  then  be  defined  by: 

Ult(V,  U)  = { SV/U , Ev)  . 

More  generally,  if  N is  an  inner  model  of  ZFC  and  (N,  e,U)  \=  U is  an  ultrafilter 
over  a set  S,  an  analogous  procedure  leads  to  the  ultrapower  of  A'  by  U using 
functions  in  SN  (1  N\ 

UlfiW,  U)  = (SN/U,  Eu)  . 

It  is  not  necessary  to  assume  that  U e N. 

Continuing  in  terms  of  V,  the  needed  version  of  Los’s  fundamental  result 
can  be  established  by  the  usual  induction  on  formula  complexity: 

5.2  Theorem.  For  anv  formula  cp(v\, . . . , vn)  of  Ce  and  f\, . . . , f,  functions: 
S^V, 

Ult(V,  U)  h <p[(.fi)0u, ....  (. fn)°u ] iff  a e S I fnm  e U . H 
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As  this  depends  on  the  satisfaction  relation  for  proper  classes,  it  cannot  be 
formulated  as  a single  theorem  of  ZFC  but  only  as  a schema  of  theorems,  one  for 
each  particular  <p. 

Finally,  a strong  hypothesis  on  U leads  to  inner  models  corresponding  to 
ultrapowers: 

5.3  Proposition.  U is  a>\-complete  iff  E u is  well-founded. 

Proof  In  the  forward  direction,  if  there  were  a sequence  (ifn)%  I new)  such 
that  ( /„+i )£,  Eu  (fnflj  for  each  n e co,  then  f]n{i  e S \ f„+i(i)  e f„(i)}  / 0 
would  lead  to  an  infinite  descending  e-sequence  of  sets. 

Conversely,  if  there  were  sets  {X„  \ n e co]  c U yet  f\nea)  X„  g U,  then 
define  gk:  S — > V for  each  k e co  by: 

\n  — k if  i e (n,„<„  Xm)  - X„  and  n > k , 
gk(i ) = | 

l 0 otherwise  . 

Then  {i  e S \ gk+l(i)  e gk(i)}  2 f]m<kXm  - f^n€aXn  e U for  k e m,  and  so 
((gn)u  | n e co)  confirms  that  Ev  is  ill-founded.  H 

In  connection  with  the  Collapsing  Lemma  0.4  note  that  Eu  is  set-like:  Sup- 
pose that  (g)°,  Ev  (f)^,  and  let  g0  e (g)°u-  Define  gi:  S -»  V by 

f god)  if  god)  e fd)  , 

g id)  = 

l 0 otherwise  . 

Then  gt  e (g)^  and  rank(gi)  < rank (/).  Hence,  rank((g)^)  < rank(/)  + 1,  and 
so  {(g)?,  | ( g)°u  Ev  (/)»}  c Fra„k(/)+2  and  is  therefore  a set. 

If  U is  co\ -complete,  it  thus  follows  from  5.3  and  0.4  that  there  is  a transitive 
class  Mv  and  an  isomorphism 

ttu:  Ult(V,  U)  -»  (Mu,  e)  . 

It  is  immediate  from  Los’s  theorem  5.2  that  Mu  is  an  inner  model  of  ZFC.  Hence- 
forth to  be  used  is  the  preferred  notation 

[f]u=nu((ffu)  for  /:  5 — ► V , 

emphasizing  that  / represents  the  set  \ f]u  in  the  inner  model  Mu-  That  / need 
only  be  defined  on  a set  X C S in  U will  often  be  used  without  further  mention. 
Finally,  for  any  set  x let  fx  be  the  constant  function:  S ->  {a:}.  Then  setting 

ju (x)  = UAu  for  .ref, 

ju-  V < Mv  by  5.2,  the  usual  canonical  embedding  into  the  ultrapower  modulated 
by  its  transitive  collapse.  The  situation  will  usually  be  summed  up  by 

jv:  V < Mu  = Ult(V,  U)  . 
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More  generally,  if  N is  an  inner  model  of  ZFC,  ( N , e,  U)  |=  U is  an  ultrafilter 
over  a set  S,  and  U is  u>\ -complete  (in  V),  then  an  analogous  procedure  using 
functions  in  SN  Cl  N leads  to 


ju\  N<Ml,  = Ult (N,  U ) . 


Again,  it  is  not  necessary  to  assume  that  U e N.  Subscripts  will  usually  be 
suppressed  when  clear  from  the  context. 

With  these  preliminaries,  we  can  turn  at  last  to  measurable  cardinals: 

5.4  Proposition.  Suppose  that  U is  a K-complete  ultrafilter  over  a measurable 
cardinal  k with  corresponding  embedding  j : V -<  M = Ult(V,  U).  Then  crit(y)  = 

AT. 

Proof.  First  observe  that  j (a)  = a for  every  a < k:  If  not,  let  a < k be  the 
least  such  that  j(a)  > a.  If  [/]  = a,  then  {£  < k \ f(fi)  < a}  e U,  and  so  by 
/r -completeness  there  is  a p < a such  that  [f</c|/(f)  = /5]et/.  But  then, 
[/]  = j(fi)  — P is  a contradiction. 

Next,  since  U is  a: -complete,  it  contains  no  bounded  subsets  of  k.  So,  with 
id:  k — >-  at  the  identity  map  on  k,  for  any  a < k,  {f|a<f<A:}  e U implies 
that  a — j(a)  < [id]  < j(K).  Hence,  k < [id]  < j(K),  and  k — critf  /).  H 

Here  is  Scott’s  original  argument,  which  builds  on  the  Keisler  [62]  ultraprod- 
uct proof  of  the  Hanf-Tarski  result. 

5.5  Corollary  (Scott  [61]).  If  there  is  a measurable  cardinal,  then  V f L. 

Proof.  Building  on  5.4  suppose  that  k is  the  least  measurable  cardinal,  and  assume 
to  the  contrary  that  V — L.  Then  M C L.  so  M = L.  But  also  by  elementarily, 
(j(K)  is  the  least  measurable  cardinal)M,  contradicting  j(K)  > k.  H 

The  following  result  is  a converse  to  5.4,  and  was  first  noticed  in  a local 
form. 

5.6  Theorem  (Keisler  [62a]).  If  there  is  an  elementary  embedding  j:  V < M for 
some  inner  model  M,  then  its  critical  point  is  a measurable  cardinal. 

Proof.  Set  S = crit(y).  Then  8 > a>  since  every  ordinal  < co  is  definable  and  j is 
elementary.  Now  define  U by: 

X eU  iff  X <Z8  a 8 e j(X)  . 

The  proof  is  completed  by  showing  that  U is  a (non-principal)  5-complete  ultrafilter 
over  8.  (This  in  particular  implies  that  8 is  regular  and  hence  a cardinal.)  Note 
first  that  8 e U,  yet  a < 8 implies  that  {a}  f U since  j ({a})  = {a}.  To  check  that 
U is  an  ultrafilter  is  straightforward.  Finally,  to  verify  <5-completeness,  suppose 
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that  y < S and  X e YJJ.  Then  S e <f\{j  {X  (a))  \ a < y).  But  since  j(a)  — a 
for  a < y,  j(X)  is  a function  with  domain  y such  that  j(X)(ot ) = j(X(a))  for 
every  a < y.  Hence, 

j(f]{X(a)  | a < y})  = flU  W(a)  I « < Y)  = PK./WaO)  I a < y]  , 
and  so  f]{X(a)  | a < y]  e U.  H 

Since  the  quantifier  3 j over  classes  j cannot  be  formalized  in  ZFC,  5.6  can 
only  be  regarded  as  a schema  of  theorems,  one  for  each  j,  or  else  be  formalized 
with  V replaced  by  a Va  with  a > crit (j)  as  indeed  was  Keisler’s  approach.  In 
any  case,  measurable  cardinals  exist  exactly  when  there  are  non-trivial  elementary 
embeddings  of  (initial  segments  of)  the  universe.  Consequently,  if  the  investigation 
of  well-founded  models  of  set  theory  is  to  move  beyond  restrictions  via  inner 
models  and  extensions  via  forcing  and  to  comprehend  elementary  embeddings, 
then  measurable  cardinals  become  intrinsically  necessary. 

The  proof  of  5.6  shows  how  critical  points  of  elementary  embeddings  can  be 
used  like  principal  generators  for  ultrafilters,  which  in  turn  lead  to  ultrapowers  and 
elementary  embeddings.  Not  every  elementary  embedding  is  an  ultrapower  embed- 
ding (as  will  become  clear  in  subsequent  sections),  and  the  switch  to  ultrapowers 
has  definite  advantages  since  the  concrete  structure  yields  more  information: 

5.7  Proposition.  Suppose  that  U is  a K-complete  ultrafilter  over  k > a>  and 
j:  V < M = Ult(V,  U ) the  corresponding  embedding.  Then: 

(a)  j(x)  = x for  every  x e VK,  and  so  V.M  = VK;  j ( X ) fl  VK  = X for  every 
X C VK,  and  so  V™+x  = VK+1;  and  k+m  = k+. 

(b)  2K  < (2k)m  < jfjc ) < (2*)+. 

(c)  If  6 is  a strong  limit  cardinal  of  cofinality  fi  k,  then  j(9)  = 6. 

(d)  K M C M yet  K M g M,  i.e.  M is  closed  under  the  taking  of  arbitrary 
K-sequences,  but  not  of  arbitrary  k+ -sequences. 

(e)  U f M. 

Proof,  (a)  The  first  assertion  follows  from  5.4  and  the  rank  argument  for  5.1(b); 
the  rest  follow  in  sequence,  with  k+m  — k+  a consequence  of  M containing  every 
well-ordering  of  k. 

(b)  2K  < (2k)m  since  V(k)m  — V(k)  by  (a)  and  M c V.  ( 2K)M  < j(K ) since 
j(/c)  is  inaccessible  in  M.  Finally,  j (k)  = {[/]  | / e kk}  so  that  j(K)  < (2K)+ . 

(c)  It  suffices  to  take  6 > k,  suppose  that  [/]  < j{9),  and  then  show  that 
[/]  < 9:  We  can  assume  that  /(£)  < 9 for  every  § < k,  and  so  since  cf($)  k, 
there  is  an  a < 9 such  that  {f  < k \ f(^)<a}eU.  (If  cf(0)  < k,  this  follows 
from  k -completeness,  and  if  cf(0)  > k,  we  could  take  a = sup(ran(/)).)  Thus, 

[/]<;(«)  = {[g]  I *€'«}<« 


as  9 is  a strong  limit. 
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(d)  Suppose  that  {[/„]  | a < k]  c M;  a g:  k ->  V must  be  found  so  that 
[g]  = ([/«]  | a < k).  Let  h : k — > k be  so  that  [h]  — k.  For  each  f < k,  let 
g(f)  be  that  function  with  domain  /i(f)  satisfying  (g(f))(a)  = /„(£).  By  the 
Los  theorem  5.2,  [g]  is  a function  with  domain  [h]  — k,  and  for  each  a < k, 
[*](«)  = [/«]• 

The  second  assertion  will  be  established  by  showing  that  j“K+  f M:  j“K+ 
is  cofinal  in  j (at+),  for  if  [/]  < j (k+),  we  can  assume  that  /(f)  < k+  for  every 
f < k,  take  a — sup(ran(/))  < k+,  and  see  that  [/]  < j{a).  But  j“k+  has 
ordertype  k+ , which  by  (b)  is  less  than  j(at+),  and  so  j“t<+  e M would  contradict 
M \=  j(K+ ) is  regular. 

(e)  kk  = (‘ kk)m  e M by  (a).  If  U e M,  then  the  map  sending  / e kk 

to  [/]  would  also  be  in  M.  But  then,  j (k)  < (2K)+M  by  the  argument  for  (b), 
contradicting  the  inaccessibility  of  j (k)  in  M.  H 

The  point  of  (c)  is  that  there  is  a definable,  proper  class  of  ordinals  fixed  by 
j.  (b)  or  (e)  imply  that  M ^ V,  so  that  there  can  be  no  ultrapower  embedding: 
V < V;  In  fact,  there  can  be  no  embedding:  V < V of  any  sort  (23.12),  and  this 
was  to  be  a watershed  for  the  overall  theory.  That  V^+[  — VK+\  leads  to  a striking 
observation  about  how  the  measurability  of  k controls  the  size  of  2K : 

5.8  Corollary  (Scott).  If  k is  measurable  and  2"  = a+  for  every  a < k,  then 

2K  = K+. 

Proof.  With  j:  V -<  M = Ult(V,  U)  as  before,  k < j (k)  and  elementarity  implies 
that  (2k)m  = k+m . But  then,  2K  < (2K)M  — k+m  = k+ . H 

Similarly,  if  k is  measurable  and  2"  < a++  for  every  a < k,  then  2K  < /f++, 
and  so  forth.  Actually  getting  a measurable  cardinal  k satisfying  2K  > k+  turned 
out  to  require  strong  hypotheses,  and  the  investigation  of  this  possibility  was  to 
lead  to  the  development  of  important  new  forcing  techniques  (see  volume  II). 

The  Czech  mathematician  Petr  Vopenka  [62]  independently  derived  Scott’s 
result  5.5  from  a difference  in  cardinal  arithmetic  between  V and  M as  for  5.7(b), 
since  j (k)  is  inaccessible  in  M.  Vopenka  and  his  student  Karel  Hrbacek  then 
established  a global  generalization  of  Scott’s  result  for  the  inner  models  L(A ) 
defined  in  §3: 

5.9  Theorem  (Vopenka-Hrbacek  [66]).  If  there  is  a strongly  compact  cardinal, 
then  V f L(A)  for  any  set  A. 

Proof.  Suppose  that  k is  strongly  compact,  and  assume  to  the  contrary  that  V — 
L(A)  for  some  set  A,  which  we  can  take  to  be  transitive.  Set  X = max({/c,  |A|})+. 
By  4.1,  the  k -complete  filter  generated  by{{f|o,<f  <k]|a<A]  can  be 
extended  to  a a: -complete  ultrafilter  U over  X.  Let 

j:  V -<  M = Ult(V,  U)  with  M = {[/]  | /:  X -»  V} 


as  usual. 
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Now  let  Ult~(V,  U)  be  that  substructure  of  Ult(V,  U ) with  domain  consist- 
ing only  of  those  equivalence  classes  containing  functions  /:  X — > V such  that 
|ran(/)|  < X.  Los’s  Theorem  also  holds  for  Ult-(V,  U):  For  the  induction  step 
to  the  existential  quantifier,  note  that  if 

{«  < A,  | 3vn+i<p[fi(a), f„(a)]}  e U , 

where  each  |ran(/))|  < K then  there  are  less  than  X 77-tuples  (/i(a),  . . . , /„(«)) 
involved,  and  so  there  is  a g:  X ->  V with  |ran(g)|  < X such  that 

{a  < A | (p[fi(a), . . . , /„(a),  g(a)]}  e U . 

Ult“(V,  U)  is  well-founded,  so  let  N be  its  transitive  collapse  and 

k:  V < N = Ult“(V,  U)  . 

For  /:  X -»  V with  |ran(/)|  < X let  [/]~  denote  that  element  of  N corresponding 
to  the  equivalence  class  of  / in  Ult“(V,  U).  Next,  let  i:  N M be  defined  by 
i ([/]“)  = [/];  it  is  readily  seen  that  this  definition  does  not  depend  on  the  choice 
of  / representing  [/]“.  Then: 

(i)  i is  elementary,  and  j = i o k . 

(ii)  /(a)  = a for  every  a < k{ X),  and  i(k(A))  = k(A ) . 

(iii)  k(X)  — sup({L(a)  | a < X})  < [id]  < j (a),  where  id:  X ->  X is  the 

identity  map  on  X . 

For  (ii)  it  can  in  fact  be  proved  by  induction  on  e that  [ /]~  = [/]  for  any  / 
such  that  the  transitive  closure  of  ran (/)  has  cardinality  less  than  X.  For  (iii) 
the  equality  follows  from  the  observation  that  if  {£  < X \ g(§)  < X}  e U and 

|ran(g)|  < X,  then  {§  < X \ g(f)  < a}  e U for  some  a < X by  the  regularity  of 

X ; the  rest  follows  as  for  5.4. 

A contradiction  can  now  be  derived  as  follows:  Since  V is  the  class  L(A ) 
definable  from  A and  k is  elementary,  N is  the  class  L(k(A))N  definable  from 
k(A).  It  follows  from  3.2(a)  that  N = L{k(A))N  = L(k(A)).  Similarly,  since  i 
is  elementary  and  i(k(A ))  = k(A),  M — L(k(A))  — N.  Now  A is  a successor 
cardinal,  say  of  X0.  So  in  M,  j (a)  is  the  successor  of  j(X 0)  and  in  N,  k(X)  is 
the  successor  of  k(X 0).  But  k(X 0)  = i(k(Xo))  — j(X 0)  by  (ii),  so  M = N implies 
that  k(X)  — j(X).  This  contradicts  (iii).  H 

Normality 

Scott  devised  a further  means  of  extracting  information  from  the  ultrapower  con- 
struction, particularly  about  reflection  phenomena  at  measurable  cardinals.  The 
key  concept  that  he  isolated  has  a combinatorial  formulation  in  terms  of  filters. 
For  a filter  F over  X, 

F is  normal  iff  for  any  (Xa  \ a < X)  e XF  its  diagonal  intersection 
A0<iX0  = {£  < X | $ e na<?  Xa]  e F . 
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A filter  over  a cardinal  X contains  the  final  segments  [f  | or  < f < A}  for 
every  a < X by  a §0  convention.  By  setting  Xa  — X for  a sufficiently  large  it 
follows  that  normality  subsumes  X-completeness.  Also,  the  diagonal  intersection 
of  ({/«  £ co  | n + 1 < /«}  | n £ co)  is  empty.  Hence,  if  there  is  a normal  filter 
over  X,  then  X is  regular  and  uncountable.  For  such  X,  the  closed  unbounded  filter 
Cx  is  normal  (0.1(b)).  Oswald  Veblen  [08]  first  took  the  diagonal  intersection  of 
closed  unbounded  sets,  although  he  did  so  only  in  the  special  case  of  (Ca  \ a < X) 
where  for  each  0 < a < A.,  f e Ca  iff  f is  a limit  point  of  Cp  for  every  / < a. 
That  Fodor’s  regressive  function  lemma  0.1(c)  is  just  a dual  formulation  was  soon 
realized  as  a general  and  useful  fact  about  normal  filters.  For  a filter  F over  a set 
S and  ICS,  X is  F -stationary  iff  X D Z 0 for  every  Z e F. 

5.10  Exercise.  A filter  F over  X is  normal  iff  for  any  F -stationary  X and  regressive 
/:  X — > X there  is  an  a < X such  that  /-1({a})  is  F -stationary  in  X. 

Flint.  In  the  forward  direction,  if  for  every  a < X,  y 1 ({or})  is  not  E-stationary 
so  that  X — /_1({a})  £ F,  then  X fl  Aa<).  (X  — f~l  ({«}))  = 0. 

Conversely,  if  (Xa  \ a < X)  e lF  yet  Aa<xXa  / F,  then  X — Aa<xXa  is 
E-stationary.  Define  / on  this  set  by:  /(f)  = least  a such  that  £ f Xa.  Then  / 
is  regressive,  yet  /-1({a})  n Xa  = 0 for  every  a < X.  H 

It  follows  that  any  normal  filter  over  X extends  Cx:  If  C is  closed  unbounded 
in  X,  then  /(f)  = sup(C  fl  f)  < f whenever  f f C U [0],  yet  for  each  a < 
X,  [f  | /(f)  = a)  is  a bounded  subset  of  X. 

In  the  first  result  5.4  about  measurable  cardinals  and  ultrapowers  the  identity 
map  id:  k -»  k had  been  used  as  a convenience.  Scott  saw  that  significant  results 
can  be  derived  by  making  it  a focus  of  attention: 

5.11  Exercise  (Scott  - Keisler-Tarski  [64:  244]).  For  a K-complete  ultrafilter  U 
over  k > co  the  following  are  equivalent: 

(a)  U is  normal. 

(b)  Whenever  f e kk  and  [f  < k \ /(f)  < f]  £ U,  there  is  an  a < k such 
that  [f  < k | /(f)  = a]  e U . 

(c)  [idjy  = k,  where  id:  k ->  k is  the  identity  map  on  k. 

Hint.  The  equivalence  of  (a)  and  (b)  follows  from  5.10.  H 

5.12  Exercise  (Scott  - Keisler-Tarski  [64:241]).  If  k is  measurable,  then  there  is 
a normal  ultrafilter  over  k. 

Hint.  Suppose  that  U is  any  /c-complete  ultrafilter  over  k,  and  / e kk  satisfies 
[/](/  = k.  Then 

| f-\X)eU] 


is  a normal  ultrafilter  over  k. 
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Thus,  any  /c-complete  ultrafilter  can  be  “normalized”  by  a simple  projection 
process.  The  proof  of  5.6  provided  a canonical  way  of  defining  an  ultrafilter  from 
an  elementary  embedding;  it  is  simple  to  check  that  the  ultrafilter  U defined  in  the 
proof  of  5.6  is  normal.  Normality  leads  to  simple  but  informative  properties  about 
the  corresponding  ultrapower  that  foreshadow  later  work: 

5.13  Proposition.  Suppose  that  U is  a normal  ultrafilter  over  k.  Then: 

(a)  Mu  = {jaUm  | /:  k V}. 

(b)  If  U had  been  defined  as  for  5.6  from  a j:  V -<  M with  critical  point 
k,  then  there  is  an  elementaiy  embedding  k:  Mu  < M such  that  k o jv  = j. 
Moreover,  if  j = ju  for  some  normal  ultrafilter  N over  k,  then  U = N and  so  k 
is  the  identity. 

Proof,  (a)  If  x £ Mv,  then  x — [f]u  for  some  f:ic—>V.  But  then,  by  5.11(c), 
ju(f)M  — ju(f)([id\u)  — l.f]u,  since  the  last  equality  is  just  a schizophrenic 
translation  of  [f  < k | /(f)  = /(f)}  £ U. 

(b)  Define  k:  Mv  -*  M by  k([f\u)  — j{f){tc).  Then  for  any  formula 
<p(V\ v„). 

Mu  \=  cp[[f lb, . . . , [fn]u]  iff  {f  < K | ^[/i(f), . . . , mm  e U 

iff  M\=<p[j(JX){K),...,j{fn)(K)\ 

by  the  definition  of  U.  Hence,  k is  elementary,  and  that  k o jv  = j is  clear. 

If  j — ju  where  is  a normal  ultrafilter  over  k,  then  for  any  X C k, 

XeU  iff  ice  jN(X)  iff  {f  < K | f £ X}  = X e N 

by  the  normality  of  N.  H 

(a)  asserts  that  My  is  a simple  closure  of  the  minimal  information 
{//(/)  | /:  k —>  V}  U {at},  and  (b),  that  Mv  is  minimal  in  a natural  sense 
among  inner  models  provided  by  elementary  embeddings.  From  another  van- 
tage point,  that  [id]f/  = k leads  to  rather  pathological  consequences  about 
U that  belie  the  term  normal.  For  example,  since  VK+\  C Mv  by  5.7(a), 
Mv  \=  k = [id];/  is  inaccessible.  Hence,  the  sparse  set  {a  < k \ a is  inaccessible} 
is  in  U even  though  ultrafilters  intuitively  consist  of  large  sets.  Moreover, 
as  a model  for  arguments  with  weakly  compact  cardinals  Keisler-Tarski  [64] 
showed  that  normal  ultrafilters  are  closed  under  Mahlo’s  operation  M(X ) = [a  e 
X | X fl  a is  stationary  in  a}.  A short  model-theoretic  proof  is  given  here;  the 
original  proof  was  later  adapted  to  a more  general  setting  by  Solovay  (16.8(c)). 

5.14  Proposition  (Keisler-Tarski  [64: 248]).  Suppose  that  U is  a normal  ultrafilter 
over  k.  If  S c k is  stationary  in  k,  then 

[cr  < at  | S fl  a is  stationary  in  a}  € U . 


Hence,  ifXeU,  then  M(X)  e U. 
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Proof.  By  5.7(a),  jij(S)  fl  k = S and  so  ju(S)C\K  is  stationary  in  k in  Mv.  Since 
[id]f/  = k,  it  follows  that  {a  < k \ S fl  a is  stationary  in  a}  e U.  The  second 
assertion  follows  since  normal  filters  contain  every  closed  unbounded  set,  and  so 
any  X e U is  stationary.  H 

More  can  be  said  outright  using  a further  reflection  argument: 

5.15  Proposition.  If  U is  a normal  ultrafilter  over  k,  then 

{a  < k | a is  weakly  compact}  e U . 

Proof.  It  suffices  to  show  that  k is  weakly  compact  in  Mu.  Since  weak  com- 
pactness is  characterized  by  the  Extension  Property  of  4.5,  k has  this  property, 
and  it  suffices  to  check  that  k has  this  property  in  My.  But  if  R C VK,  there 
is  a transitive  X VK  and  an  S c X such  that  (VK,e,  R)  -<  {X,  e,  S)  exactly 
when  there  is  such  an  X satisfying  |X|  = k,  by  the  Lowenheim-Skolem  Theorem 
0.5.  Hence,  there  is  such  a structure  {X,  e,  S)  coded  as  a subset  of  VK.  Also,  the 
appropriate  satisfaction  relation  can  be  coded  as  a subset  of  VK.  But  VK+\  c Mv, 
and  so  the  proof  is  complete.  H 

Hence,  the  weak  compactness  of  a measurable  cardinal  k reflects,  and  k is 
the  /rth  weakly  compact  cardinal.  This  was  first  established  by  Hanf  and  Scott 
(6.5).  As  expected,  there  is  a limit  to  this  sort  of  reflection: 

5.16  Proposition  (Solovay).  For  any  measurable  cardinal  k,  there  is  a normal 
ultrafilter  U over  k such  that 

{a  < k | a is  not  measurable}  e U . 

Proof.  Proceeding  by  induction  let  k be  measurable,  U a normal  ultrafilter  over 
k,  and  set  A — {a  < k \ a is  measurable}.  If  A £ U,  then  we  are  done. 
Otherwise,  for  each  a e A inductively  let  Ua  be  a normal  ultrafilter  over  a such 
that  A fl  a ^ Ua.  Define  W over  k by: 

X e W iff  X c.  k a {ae  A \ X<Aa  e UJ  e U . 

It  is  simple  to  check  that  IT  is  a normal  ultrafilter  over  k such  that  A <£  W.  H 

Finally,  Scott’s  original  proof  of  5.8,  implicit  in  Hanf-Scott  [61],  is  given  for 
which  a weaker  hypothesis  suffices: 

5.17  Proposition  (Scott).  If  there  is  a normal  ultrafilter  U over  k such  that 
{a  < k | 2“  = o,+}  e U , then  2K  = k+. 

Proof.  Since  [id]f/  = k,  (2k)Mu  = k+Mu , and  the  argument  can  be  completed  as 
for  5.8.  H 
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This  argument  was  first  published  in  Vopenka  [65],  which  also  noted  that  if 
{a  < k | 2“  < a++]  e U,  then  2K  < at++,  and  so  forth. 

Combinatorial  features  of  the  use  of  normality  eventually  emerged  in  gener- 
alizations, and  besides  having  combinatorial  ramifications  normality  was  to  play 
a crucial  role  in  the  structural  investigation  of  inner  models  of  large  cardinals 
(§§19,20).  All  in  all,  it  is  remarkable  that  a combinatorial  contingency  noticed  by 
Ulam  in  his  salad  days  in  Lwow  should  emerge  so  transfigured  by  the  infusion  of 
ultrapowers  and  normality. 


6.  Indescribability 
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6.  Indescribability 

The  formalization  of  reflection  properties  was  one  of  the  early  developments  in 
the  increasing  reliance  on  model-theoretic  approaches  in  set  theory.  After  Richard 
Montague  [55,61]  studied  reflection  phenomena  and  showed  that  ZFC  is  not 
finitely  axiomatizable  in  a strong  sense,  Levy  [60a]  established  their  broader  sig- 
nificance and  the  close  involvement  of  the  Mahlo  hierarchy.  Then  Hanf-Scott  [61] 
postulated  reflection  properties  directly  for  structures  (VK,  e,  R)  where  R c VK 
in  terms  of  higher-order  languages.  They  showed  that  this  generalization  in  the 
direction  of  higher  types  rather  than  infinitely  long  formulas  results  in  a schematic 
approach  to  comparing  large  cardinals  by  size,  and  moreover  provides  a nice 
characterization  of  weak  compactness.  The  theory  was  further  elaborated  by  Levy 
[71],  and  led  in  the  1970’s  to  important  analogues  in  the  theory  of  admissible  sets 
and  inductive  definitions. 

Levy’s  initial  observations  lay  the  groundwork.  The  following  lemma  illus- 
trates the  contemporary  importance  of  closed  unbounded  sets  owing  to  the  infusion 
of  model-theoretic  techniques: 

6.1  Lemma.  Suppose  that  k is  inaccessible  and  R C VK.  Then 

{a  <K  I (Va,e,RnVa)  -<  <V„e,/?>} 

is  closed  unbounded  in  k. 

Proof.  That  this  set  is  closed  is  immediate.  To  show  that  it  is  unbounded,  let  a < k 
be  arbitrary.  Define  an  < k for  n e a>  by  recursion  as  follows:  Set  ao  = a.  Given 
an  < k define  a„+i  to  be  the  least  /l  > an  such  that  whenever  y\, ...  ,yk  e Van 
and  (VK,  e,  R)  [=  3i>o<p[yi, . . . , »]  for  some  formula  <p,  there  is  an  x e Vp  such 
that  (VK,e,R)  |=  <p[x,y\, .. . ,yf\.  Since  k is  inaccessible,  \VaJ  < k and  so 
an+i  < k.  Finally,  set  a — sup({a„  | n e a>}).  Then  (Va,  e,  R D Va)  -<  { VK , e,  R) 
by  the  usual  (Tarski)  criterion  for  elementary  substructure.  H 

The  first  part  of  the  next  proposition  recalls  1.3,  and  is  really  a synoptic 
version. 

6.2  Proposition  (Levy  [60a]). 

(a)  k is  inaccessible  iff  for  any  RCV]  there  is  an  a < k such  that 
(Va,  e,  R n Va)  -<  {VK,  e,  R). 

(b)  k is  Mahlo  iff for  any  Set)  there  is  an  inaccessible  cardinal  a < k 
such  that  (Va,  e,  R IT  Va)  < { VK , e,  R). 

Proof.  The  forward  directions  are  immediate  by  6.1.  For  the  converses,  it  is 
simple  to  see  that  k > a>.  Note  that  k is  regular:  If  not,  there  would  be  a /}  < k 
and  a function  F:  f}  —*■  k with  range  unbounded  in  k.  Set  R = [/S]  U F.  By 
hypothesis  there  is  an  a < k such  that  { Va,  e,  R (T  Va)  -<  ( VK , e,  R).  Since  /S  is 


58 


Chapter  1 . Beginnings 


the  single  ordinal  in  R,  f e Va  by  elementarily.  But  this  is  a contradiction  since 
the  domain  of  F fl  Va  cannot  be  all  of  fi. 

Next,  k is  a strong  limit:  If  not,  there  would  be  a X < k such  that  2k  > k. 
Let  G:  T(\)  ->  k be  surjective  and  set  R = {A.  + 1}  U G.  By  hypothesis,  there  is 
an  a < k such  that  (' Va , e,  R fl  Va)  -<  ( VK , e,  R).  A + 1 e Va  and  so  V(X)  e Va, 
but  this  is  a contradiction  as  before. 

Finally,  for  (b)  k is  Mahlo:  If  not,  there  would  be  a C closed  unbounded 
in  k containing  no  inaccessible  cardinals.  By  the  hypothesis  from  (b)  there  is  an 
inaccessible  a < k such  that  (Va,  e,  Cfl  Va)  -<  (VK,e,C).  By  elementarily  Cfia 
is  unbounded  in  a.  But  then,  a e C,  which  is  a contradiction.  H 

The  Reflection  Principle  for  ZF  of  Montague  [61]  and  Levy  [60a]  asserts  that 
for  any  formula  <p(v i,  . . . , vn)  and  any  /3,  there  is  a limit  ordinal  a > fi  such  that 
for  any  x\, . . . , xn  e Va, 

<p[x\ ,xn]  iff  (pVa\x u...,xn\. 

This  principle  is  implicit  in  remarks  of  Godel  made  in  1946  (see  Godel  [90:  146]). 
Classes  R can  be  incorporated  through  their  definitions  to  achieve  a certain  re- 
semblance to  the  latter  part  of  6.2(a)  with  V replacing  VK,  but  there  are  crucial 
differences:  Only  one  formula  can  be  reflected,  not  all  formulas  as  with  elemen- 
tary substructures;  indeed,  the  arithmetization  of  syntax  cannot  be  completed  in 
ZF  to  find  an  a such  that  Va  ■<  V because  of  the  undefinability  of  truth.  Also, 
arbitrary  R c VK  are  allowed  in  6.2(a);  this  incorporates  a bit  of  second-order 
logic  to  attain  the  requisite  strength  (cf.  1.3  and  the  discussion  before). 

Levy  [60a:  234]  observed  that  the  Reflection  Principle  is  equivalent  to  the 
Axiom  of  Infinity  and  the  Replacement  Schema  in  the  presence  of  the  other  ZF 
axioms.  Asserting  this  principle  within  some  VK  with  arbitrary  R C VK  necessitates 
the  inaccessibility  of  k by  the  proof  of  6.2(a).  Next  consider  the  reflection  of  V to 
a Va  for  a inaccessible;  asserting  this  new  principle  within  some  VK  with  arbitrary 
R f=.  K necessitates  the  Mahloness  of  k by  the  proof  of  6.2(b),  and  so  forth.  This 
is  reflecting  reflection:  a reflection  scheme  is  first  formulated  and  then  is  itself 
reflected!  In  this  way  Levy  developed  a hierarchy  of  set  theories  that  correspond 
to  the  a -Mahlo  cardinals  and  showed  how  the  iterative  formalization  of  reflection 
illuminates  Mahlo’s  original  scheme,  formulated  half  a century  before. 

The  formulations  of  Hanf  and  Scott  using  higher-order  languages  are  the  main 
topics  of  this  section.  The  77™  and  S'f  formulas  for  m ,neco  were  described  in 
§0;  for  Q either  77'"  or  F"', 

k is  Q-indescribable  iff  for  any  R C VK  and  Q sentence 
such  that  (VK,  e,  R)  \ = cp,  there  is 
an  a < k such  that  { Va,  e,  R fl  Va)  |=  <p  . 

Over  structures  of  form  ( Va , e,  R)  with  a a limit  and  R C Va,  every  higher-order 
formula  is  equivalent  by  standard  means  to  a 77"'  or  Ff  formula  for  some  m and 
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n with  blocks  of  like  quantifiers  contracted  into  one.  Including  arbitrary  R C VK 
bolsters  the  theory  with  nice  consequences,  although  the  subject  can  be  pursued 
without  the  R’s  (cf.  6.9).  As  VK  is  closed  under  pairing,  the  definition  is  equivalent 
to  one  where  R is  replaced  by  any  finite  number  of  finitary  relations;  this  will 
often  be  invoked  in  what  follows. 

Extensions  of  the  definition  can  be  pursued  using  transfinite  types,  as  was 
done  by  Jensen  (see  Drake  [74: 284ff]).  Although  there  may  be  a traditional  bias 
for  finite  types  in  language,  several  indescribability  results  have  sweeping  proofs 
that  accommodate  stronger  modes  of  expressibility.  However,  as  for  the  Mahlo 
hierarchy  transfinite  types  may  lead  to  larger  cardinals,  but  not  to  a qualitative 
transcendence.  So,  the  following  weak  overall  notion  is  adopted,  keeping  in  mind 
that  stronger  indescribability  properties  are  often  derivable: 

k is  totally  indescribable  iff  k is  77”' -indescribable  for  every  m , n e a>  . 

Following  upon  Levy  [60a],  Bemays  [61]  postulated  the  full  second-order 
reflection  principle  for  the  universe  of  sets,  which  amounts  to  ascribing  77,J- 
indescribability  to  On  for  every  n e to.  Pursuing  his  second-order  approach 
to  set  theory  he  showed  that  his  principle  implies  the  Global  Axiom  of  Choice: 
There  is  a class  function  F such  that  for  any  non-empty  set  x,  F(x)  e x.  These 
investigations  have  been  continued  by  Rolando  Chuaqui  [78,81]. 

Turning  to  the  subject  at  hand,  some  observations  are  first  made  for  the  case 
m — 1: 

6.3  Proposition. 

(a)  For  any  n,  k is  Sf  ^-indescribable  iff  k is  Fl^-indescribable. 

(b)  k is  Xj1 -indescribable  iff  k is  inaccessible. 

Proof,  (a)  In  the  substantive  direction,  suppose  that  R C VK  and  (VK,  e,  R)  |= 
3 Xcp(X)  say,  where  X is  type  2 and  (p  is  77(J.  So,  there  is  an  S c VK  such  that 
{VK,  e,  R)  \=  (p[S~\.  But  then,  X can  be  replaced  in  tp  by  a new  predicate  symbol 
and  77([ -indescribability  applied  to  (VK,  e,  R,  S). 

(b)  This  follows  by  reducing  to  first-order  formulas  by  (a)  and  using  the 
arguments  for  6.2(a).  H 

What  about  77'  -indescribability?  Hanf  and  Scott  found  that  it  provides  a nice 
characterization: 

6.4  Theorem  (Hanf-Scott  [61]).  k is  n\ -indescribable  iff  k is  weakly  compact. 

Proof.  We  can  argue  in  terms  of  the  4.5  characterization  of  weak  compactness 
in  terms  of  the  Extension  Property.  For  this  purpose,  suppose  first  that  k is 
inaccessible  and  R c VK.  If  there  is  a transitive  X f VK  and  an  S c X such 
that  (K,  e,  R)  -<  ( X , e,  S),  then  there  is  such  an  X satisfying  |X|  = \VK\  by  the 
Lowenheim-Skolem  Theorem  0.5.  Using  an  injection  of  X into  VK,  this  in  turn 
amounts  to  asserting  that  there  is  an  A C VK  coding  a structure  (F,  E , T)  with  E 
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a binary  relation  satisfying  the  hypotheses  of  the  Collapsing  Lemma  0.4,  and  a 
proper  elementary  embedding  J:  (VK,  e,  R)  — > (Y,  E , T).  That  E is  well-founded 
is  a first-order  assertion  in  (VK,  e,  £}  saying  that  there  are  no  infinite  Zs-descending 
sequences,  since  WVK  c V*  by  the  inaccessibility  of  /c.  That  J is  elementary  is 
expressible  in  terms  of  the  satisfaction  relation,  formalizable  as  usual  through  the 
existence  of  a satisfaction  sequence  K defined  by  recursion  on  formula  complexity. 
Thus,  when  properly  formalized  the  Extension  Property  holds  for  R c VK  exactly 
when  (VK,  e,  R)  satisfies  a S\  sentence  3A3J3Kp  where  p is  first-order. 

Suppose  now  that  k is  77* -indescribable.  Then  k is  inaccessible  by  6.3. 
Assume  to  the  contrary  that  the  Extension  Property  fails  for  some  7?  c VK.  Then 
(VK,  e,  7?)  1=  a for  a 77/  sentence  a equivalent  to  the  negation  of  the  sentence 
above.  It  is  simple  to  devise  a 77/  sentence  r such  that  (Vj,  e)  |=  r iff  S is 
inaccessible.  Finally,  by  6.1, 

C — {f  < k \ {Vp,  e,  77  n Vp)  < (VK,  e,  R)} 

is  closed  unbounded  in  k,  and  so  (V*,  e,  R,  C)  |=  xf,  where  xf  is  a first-order 
sentence  asserting  that  C is  closed  unbounded. 

By  77/  -indescribability  there  is  an  a < k satisfying 

{ Va , e,  77  fl  Va,  C fl  Va)  \=  a A r A xjr 

so  that  in  particular  a is  inaccessible  and  a e C.  By  an  iterated  Skolem  hull 
argument,  there  is  an  (Xa,  €,  Sa)  -<  { VK , e,  R)  such  that:  Xa  is  transitive,  Va  is  a 
proper  subset  of  X„,  and  |X„|  = \Va\.  Having  arranged  a e C,  { Va,  e,  77 fl  Va)  -< 
{ Xa , e,  Sa).  But  this  contradicts  ( Va,  e,  7?  fl  Va)  (=  ct. 

For  the  converse,  suppose  that  k is  weakly  compact  and  (VK,  e,  77)  |=  <p 
where  <p  is  77/.  By  the  Extension  Property  there  is  a transitive  X f VK  and  an 
S c X such  that  {VK,  e,  7?)  -<  (Z,  e,  S).  In  particular,  (X,  e)  is  a transitive  model 
of  ZFC  with  k e X,  and  so  Vf  — VK  e X.  Since  the  only  type  2 quantifiers  of  <p 
are  universal  and  persist  downward  when  there  are  possibly  fewer  subsets  of  the 
domain, 

(X,e,S)  ^3a((Va,e,SnVa)  \=  <p)  , 

since  k is  such  an  a and  .S'  fl  VK  — R.  This  sentence  is  first-order,  so  also  holds 
in  (VK,  e,  R)  by  elementarily.  Since  k is  inaccessible,  this  means  that  there  really 
is  an  a < k such  that  (Va,  e,  7?  fl  Va ) [=  <p,  and  the  proof  is  complete.  H 

This  is  a subtle  phenomenon:  the  equivalence  of  an  existential  extension 
property  and  a universal  reflection  property.  Results  like  4.7  on  the  size  of  weakly 
compact  cardinals  in  terms  of  the  Mahlo  hierarchy  can  also  be  derived  via  77/  - 
indescribability. 

Finally,  Hanf  and  Scott  made  a general  statement  about  reflection  at  a mea- 
surable cardinal  that  subsumes  5.14  and  5.15: 
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6.5  Proposition  (Hanf-Scott  [61]).  Measurable  cardinals  are  772 -indescribable. 
Moreover,  if  U is  a normal  ultrafilter  over  k,  R C Vk,  and  ( VK , G,  R)  \ — (p  where 
q>  is  /7j2,  then 

{a<K  \ (Va,  G,  R n Va)  \=  <p\  G U . 

Proof.  Let  j:  V < M = Ult(V,  U).  VK+\  c M by  5.7(a),  so  that  second-order 
sentences  about  (VK,  g)  are  absolute  between  V and  M.  But  since  the  only  type  3 
quantifiers  of  <p  are  universal  and  persist  downward  when  there  are  possibly  fewer 
subsets  of  VK+\,  ({VK,  g,  R)  |=  (p)M . The  result  now  follows  by  normality  since 
j(R)nVK  = R.  H 


Kunen  (see  Solovay  [71]  or  Fremlin  [93])  established  a kind  of  /72-indescrib- 
ability  for  real-valued  measurable  cardinals,  which  for  instance  has  the  following 
application:  If  k < 2K°  is  real-valued  measurable,  then  there  is  a non-Lebesgue 
measurable  set  of  reals  of  cardinality  less  than  k.  Also,  Kunen  [71]  considered 
a weak  version  of  indescribability  that  does  not  entail  inaccessibility,  and  ob- 
tained results  below  2K°.  (The  formulation  of  Hanf-Scott  [61]  had  actually  been 
in  terms  of  the  weak  version,  with  the  inaccessibility  of  k an  ambient  hypothe- 
sis.) 

There  is  a E2  description  over  (VK,  e)  of  the  measurability  of  k,  asserting 
3U  G V2(k)  satisfying  a second-order  formula.  6.5  is  hence  the  best  possible  in 
the  sense  that  the  least  measurable  cardinal  is  not  X'2-indescribable.  However: 


6.6  Proposition  (Vaught  [63a]).  If  k is  measurable  and  U is  a normal  ultrafilter 
over  k,  then 

{a  < k | a is  totally  indescribable]  G U . 

Proof.  Let  j:  V < M = Ult(V,  U).  By  normality  it  suffices  to  show  that  M |= 
k is  totally  indescribable.  The  following  argument  imposes  little  restriction  on  the 
complexity  of  <p:  Suppose  that 


M b (R  c VK  A (K,  e,  R)  h <P)  ■ 


Then 


M \=  3a  < j(K)((Va,  G,  j(R)  n Va)  N <P)  , 


since  k is  such  an  a and  j (R)  n VK  — R.  By  elementarity, 


3a  < K({Va,  G,  R (~l  Va)  |=  (p)  , 


and  this  then  holds  in  M as  well,  since 


Vfi  = Vic. 


Hence,  there  are  many  cardinals  below  the  least  measurable  cardinal  k that 
may  have  stronger  reflection  properties  than  k itself.  On  the  other  hand,  unlike 
measurability  IT”  -indescribability  is  compatible  with  V = L.  The  argument 
appeals  to  specific  properties  of  L,  unlike  the  relativization  of  a -inaccessible  and 
a-Mahlo  cardinals  to  inner  models  (3.1): 
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6.7  Theorem.  Suppose  that  Q is  either  17 or  with  either  m > 1 or  else 
m — 1 and  n > 0,  and  k is  Q-indescribable.  Then  (k  is  (^-indescribable)1.  In 
particular,  if  k is  weakly  compact,  then  (k  is  weakly  compact )L . 

Proof.  For  only  this  proof  let  5+  denote  the  first  cardinal  greater  than  <5  and 
<5+(i)  the  ith  cardinal  greater  than  8 in  the  sense  of  L.  It  is  first  shown  that  for 
0 < i < a>  and  inaccessible  X,  to  each  77J  (resp.  U's)  formula  cp(X i,  ...,Xt) 
for  s > 0 a TTS+I  (resp.  X^+l)  formula  (p{Y\,  . . . , Y,)  can  be  associated  where 
type(Tt)  = type(Zi)  + i for  1 < k < t,  and  to  each  A e VHLxho)  for  j e co 
(allowing  V°(LX+ <o)  = Lx+m)  a set  A e Vj+,(VX)  = Vx+j+i  can  be  associated 
so  that 

(*)  Lk+<f)  \=  cp[Ai, A,]  iff  Vx  | = <p[A\ A,]. 

Although  this  will  only  be  needed  for  r = 0 and  cp  a sentence,  the  complexity 
must  be  maintained  for  the  inductive  argument. 

For  the  case  i = 1,  let  cr0  be  the  sentence  of  3.3(a)  so  that  for  any  transitive 
set  model  N of  a0,  N — La  for  some  a.  By  the  proof  of  GCH  in  L,  if  A e Lx+, 
A e Ly  for  some  y < X+.  Since  \Ly  \ < X,  this  amounts  to  the  assertion:  There  is 
a B c Vx  and  E a binary  relation  on  B satisfying  the  hypotheses  of  the  Collapsing 
Lemma  0.4  such  that  ( B , E)  \=  a o,  and  for  that  map  H inductively  defined  by 
H(x)  = {H(y)  | y E x)  as  in  that  lemma’s  proof  there  is  a z e B such  that 
H(z ) = A.  Coding  ( B , E,  z)  as  a subset  A of  Vx,  this  assertion  is  formalizable  as 
a formula  fiY)  over  Vx  with  A interpreting  type  2 variable  Y by  the  argument 
of  the  first  paragraph  of  the  proof  of  6.4. 

Proceed  now  by  recursion  to  define  cp  for  first-order  cp : For  atomic  formulas, 
if  Ai,  A2  e Lx+,  both  Aj  e A2  and  A\  — A2  can  be  rendered  as  X\  assertions 
over  Vx  about  A\  and  A2  using  the  f above.  If  cp  is  3X(pu(X,  X\, , X,)  and 
<Po(Y,  Y\, . . . , Yt)  corresponds  to  cpo,  then 

Lx+\=cp[Ax A,]  iff  Lx+  \=  ^0[A,  Au  . . . , A,]  for  some  A e Lx+ 

iff  Vx  |=  cpo[A,  A\, . . . , A,]  by  induction 
iff  Vx  |=  3F (f  A (po)[A\, . . . , Af]  . 

Flence,  < jp  can  be  taken  to  be  3 Y{\f  A ^0),  observing  that  if  cp  is  X1,0,  then  <p  is 
X1,1  by  induction.  The  sentential  connectives  are  immediate,  and  so  the  argument 
is  complete  for  first-order  cp.  For  higher-order  cp,  corresponding  sets  A can  be 
recursively  associated  with  sets  A in  a straightforward  manner.  This  completes 
the  case  / = 1. 

For  i > 1,  proceed  by  induction:  Using  the  i = 1 argument  with  Lx+o+ d 
replacing  Lx+  and  Lx+m  replacing  Vx,  first  translate  down  from  Lx+«+ d to  Lx+m; 
then  use  the  induction  hypothesis  to  translate  down  to  Vx.  This  is  where  the 
general  r is  needed.  The  translation  down  from  Lx+«+ u to  Lx+m  involves  one  new 
difficulty:  For  the  Vx  case,  the  first-order  expressibility  of  the  well-foundedness 
of  E depended  on  01  Vx  C Vx  as  in  the  cited  proof  of  6.4.  In  the  current  situation, 
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only  ®L^+(.)  fl  L C Lx+m  is  known  from  the  proof  of  GCH  in  L.  However, 
this  suffices  by  a basic  fact  about  inner  models  and  well-foundedness:  First,  for 
A e Lx+«+ 1),  A e Ly  for  some  y satisfying  \Ly\l  < X+(,\  so  that  we  can  assume 
that  the  corresponding  E e Lx+ m.  Now  if  E is  well-founded  in  L,  then  E is 
well-founded  in  V by  absoluteness  (0.3).  Hence,  it  is  enough  to  assert  that  there 
are  no  infinite  7s -descending  sequences  in  L,  and  for  this  mLXH»  fl  L c Lx+u> 
suffices. 

Proceeding  toward  the  main  argument  for  the  theorem,  note  that  if  X is  inac- 
cessible, then  it  is  inaccessible  in  L and  so  (VX)L  = Lx  by  1.2(a)  and  induction 
on  rank.  For  reflecting  down  to  such  a situation,  note  (as  for  6.4)  that  there  is  a 
n\  sentence  r such  that  Vs  |=  r iff  S is  inaccessible. 

Finally,  suppose  that  k is  77'" -indescribable  with  m,  n as  hypothesized.  (The 
X™  case  is  analogous,  with  the  special  X/  case  following  separately  from  6.3(b).) 
To  verify  77/"  -indescribability  in  L,  note  first  that  by  the  inaccessibility  of  k, 
(Vk)l  — Lk.  So  assume  that 

R e V(Lk)  n L and  «L„,  e,  R)  N n)L 

where  cpo  is  77™.  By  the  proof  of  GCH  in  L,  V(LK)  fl  L C LK+  and  inductively 
V(Lk)  fl  L C LK+m  for  each  i e a>.  Hence,  our  assertion  can  be  rendered  as 

(L  K+(m),  G,  R)  1=  <p\  . 

where  <p\  is  a 77°  sentence  corresponding  to  <p0.  By  previous  remarks,  this  in  turn 
translates  to 

{Vk,€,R) 

where  q>\  is  77™.  The  result  now  follows  by  reflecting  ^Ar  down  to  an  a < k 
and  translating  backwards  to  get 

({La,  e,  R n La)  \=  (p0)L  . H 

The  following  result  on  universal  formulas  is  a routine  application  of  the 
satisfaction  relation.  (See  Levy  [71: 208],  Drake  [74: 272],  or  Devlin  [75: 96ff]  for 
details.) 

6.8  Proposition.  For  any  in,  n > 0 there  is  a El"1  formula  \jimn(X,  Y ) with  X type 
2 and  Y type  1 such  that:  for  any  77™  formula  <p(X)  there  is  a k e co  such  that 
for  any  limit  ordinal  a and  R C Va, 

{ Va , e)  1=  (p[R ] iff  { Va , e)  \=  fmn[ R,  7']  . 

There  is  an  X™  formula  fmn(X,  Y)  with  the  analogous  property  for  X™  formulas 
<P(X).  * H 

A shift  from  a unary  predicate  to  a type  2 variable  was  made  to  get  the 
following  corollary: 
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6.9  Corollary. 

(a)  For  any  n there  is  a 77,j+1  sentence  x\ « such  that  for  any  k, 

(VK,e)  [=  xin  iff  k is  77, J -indescribable  . 

(By  6.3  this  subsumes  the  Z\  cases.) 

(b)  If  m > 1,  for  any  n > 0 there  is  a 17 f sentence  Xmn  such  that  for  any  k, 

(VK,  e)  [=  Xmn  iff  k is  -S'™ -indescribable  . 

There  is  a Z"'  sentence  that  similarly  characterizes  17™  -indescribabilitv. 

Proof,  (a)  For  n = 0,  let  /io  be  any  Tl\  description  of  inaccessibility.  For  n > 0, 
in  terms  of  f\„  from  6.8  let  x\n  be 

VXVY( Y)  ->  3 a > 0 (a  is  a limit  A (Va,  e)  |=  f\n(X  n Va , T)))  , 

appropriately  formalized  with  the  satisfaction  relation  for  sets.  Because  of  the  VX 
and  the  occurrence  of  ifi„(X,  Y)  to  the  left  of  ->  this  is  77, |+1. 

(b)  This  is  like  (a),  except  that  for  m > 1 the  VX  can  be  subsumed  in  the 
prenexing  procedure  for  classifying  the  resulting  formula.  H 

Set 

7r'"  = least  77™ -indescribable  cardinal , and 
<t™  = least  ^'"-indescribable  cardinal , 

with  the  assumption  implicit  in  the  use  of  this  notation  that  such  cardinals  exist. 
The  following  is  a consequence  6.3  and  6.9. 

6.10  Proposition  (Flanf-Scott  [61]). 

(a)  ct/  is  the  least  inaccessible  cardinal. 

(b)  jrl  = < ti\  = 0-3  < . . . 

(c)  For  m > 1 and  n > 0,  ct,™  f n"',  and  n ,™  < <rf+] , n™+  v H 

The  order  relationship  between  ct,™  and  n’f  for  m > 1 and  n > 0 is  discussed 
at  the  end  of  the  section. 

Levy  [71]  carried  out  a systematic  study  of  the  sizes  of  indescribable  cardi- 
nals, extending  aspects  of  Keisler-Tarski  [64].  Most  notably,  he  vitalized  the  idea 
implicit  in  that  paper  of  investigating  definable  filters,  an  idea  to  be  used  in  later 
contexts,  and  considerably  extended  6.10.  The  starting  point  of  his  approach  is 
that  various  large  cardinal  properties  are  not  only  attributable  to  cardinals,  but  also 
to  their  subsets.  For  X c.  k and  Q either  77,™  or  Z'f , 

X is  Q-indescribable  in  k iff  for  any  Key,  and  Q sentence  tp 

such  that  (VK,  G,  R)  |=  <p,  there  is 

an  a e X such  that  (Va,  e,  R fl  Va)  \=  (p  . 

This  leads  in  turn  to  the  consideration  of  the  collections 


6.  Indescribability 


65 


I —{X  C k | X is  not  Q-indescribable  in  /<-}  , 

{X  C k | X is  Q-indescribable  in  k}  , and 
F ={X  C K | k — X is  not  ^-indescribable  in  /c} 

of  the  negligible,  non-negligible,  and  all  but  negligible  subsets  of  k with  respect 
to  Q-indescribability.  It  is  simple  to  check  that  k is  Q-indescribable  iff  F is  a 
(proper)  filter,  the  Q-indescribable filter  over  k . ( F conforms  to  our  §0  convention 
about  containing  all  the  final  segments  {§  | a < £ < k},  since  the  assertion  in 
{ VK , e,  {a})  that  {a}  is  not  empty  cannot  be  reflected  down  to  any  V|  with  % < a. 
Levy  styled  the  members  of  F weakly  Q-enforceable  at  k,  and  did  not  refer 
explicitly  to  the  filter.)  I is  then  the  ideal  dual  to  F and  the  /-'-stationary  sets 
are  just  the  Q-indescribable  in  k sets.  The  77/ -indescribable  filter  is  also  known 
as  the  weakly  compact  filter  because  of  6.4.  In  the  use  of  this  terminology  it  is 
assumed  that  the  filters  are  indeed  proper,  i.e.  that  the  ambient  k has  the  requisite 
strength. 

These  definable  filters  have  a crucial  property: 

6.11  Proposition  (Levy  [71]).  For  m,n  > 0 the  77,'” -indescribable  and  X'f  - 
indescribable  filters  over  k are  normal. 

Proof.  Let  F be  the  77™ -indescribable  filter  over  k (the  S’f  case  is  analogous). 
Suppose  that  X c k and  /:  X — »■  k is  regressive.  Assuming  that  /_1({«1)  is 
not  77”' -indescribable  in  k for  any  a < k,  it  suffices  to  establish  that  X is  not 
77/”  -indescribable  in  k: 

Invoking  the  universal  formula  of  6.8,  it  can  be  assumed  that  for  each  a < k 
there  is  an  Ra  c VK  and  a ka  e co  such  that  (VK,  e)  |=  fm„[Ra-  &«]  yet 
(Vf , e>  1=  ^fmn[Ra  n Vj , ka]  for  any  f e X with  f(f)  = a.  Set 

R — {(a,  f)  | a < k A /3  e Ra}  , and 
T — {(a,  ka)  | « < at]  . 

Let  r be  any  first-order  sentence  such  that  if  (Vj,  e)  |=  r,  then  8 is  a non-zero 
limit  ordinal.  Then 

(VK,  e,  77,  T)  [=  r A VaVUVv(U  = 77“{a}  A v = T(a)  ->  fmn(U,  v))  . 

Properly  formalized  this  sentence  is  77"',  and  because  of  r any  £ that  reflects  it 
satisfies 

Rnv$  = {(a,  P)  I a < § A p e Ra  n , and 
T CV^  = T , 

and  so  % £ X as  / is  regressive.  Hence,  X is  not  77/'' -indescribable  in  at.  H 
Note  that  for  any  R c VK  and  77™  sentence  ip  such  that  { VK , e,  R)  |=  <p, 

{«  < k | (Va,  e,  R n Va)  1=  q>) 

is  in  the  77™ -indescribable  filter,  and  similarly  for  E'f . This  leads  to  the  following: 
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6.12  Exercise  (Levy  [71]). 

(a)  For  any  n,  { a < k \ a is  77;)-indescribable  in  a}  is  in  the  /7,J+1- 
indescribable  filter  over  k. 

(b)  For  m > 1 and  n > 0,  {a  < k \ a is  ^"'-indescribable  in  a]  is  in  the 
TI™ -indescribable  filter  over  k,  and  [a  < k \ a is  77,'" -indescribable  in  a } is  in  the 
E'f  -indescribable  filter  over  k. 

(c)  If  S is  stationary  in  k,  then  {a  < k \ S fl  a is  stationary  in  a}  is  in  the 
FI  * -indescribable  filter  over  k. 

Hint,  (a)  and  (b)  follow  from  the  sentences  specified  in  6.9,  and  (c)  follows  from 
devising  a 17}  sentence  a such  that  for  any  8 and  A C.  8 , (Vs,  €,  A)  \=  a iff  A 
is  stationary  in  8.  H 


Any  normal  filter  extends  the  closed  unbounded  filter  (5.1  Off),  so  the  sets 
exhibited  in  (a)  and  (b)  are  stationary  in  k.  Moreover,  (c)  implies  that  the  in- 
describable filters  are  closed  under  Mahlo’s  operation  and  strengthens  4.6.  6.12 
typifies  how  in  hierarchies  of  large  cardinals,  a cardinal  at  one  level  defines  for 
itself  its  transcendent  size  relative  to  cardinals  at  lower  levels. 

A fruitful  offshoot  of  the  study  of  large  cardinals  has  been  the  investigation 
of  their  various  analogues  in  restricted  contexts.  After  all,  natural  ideas  first 
germinated  in  the  maximal  setting  of  set  theory  ought  to  thrive  in  more  focused 
situations.  The  first  substantive  move  in  this  direction  was  made  in  the  early  1970’s 
in  the  theory  of  inductive  definitions  allowing  non-monotone  operators.  With  the 
admissible  ordinals  playing  the  role  of  regular  cardinals,  natural  analogues  of 
Mahlo  and  indescribable  cardinals  were  developed  in  this  context.  Wayne  Richter 
and  Peter  Aczel  in  their  [74]  and  Aczel  [77:  772ff]  provide  the  details.  A second 
wave  of  activity  began  in  1980  with  study  of  £„  admissibility  in  terms  of  reflection 
and  partition  properties  by  Evangelos  Kranakis  [82, 82a,  83].  This  was  extended 
by,  e.g.  Kranakis  and  lain  Phillips  in  their  [84],  and  expanded  to  the  study  of 
analogues  of  measurable  cardinals  by  Matthew  Kaufman  and  Kranakis  in  their 
[84]  and  by  Josephus  Baeten  [86]. 

Penetrating  work  by  Stal  Aanderaa  [74]  in  the  earlier  context  of  inductive 
definitions  motivated  Yiannis  Moschovakis  to  establish  the  following: 


6.13  Theorem  (Moschovakis  [76]).  If  V — L,  m > 1,  and  n > 0,  then  a'fi  < it'f . 

H 


Devlin  [75:  100]  sketches  a proof.  This  is  a curious  turn  of  events!  The 
use  of  effective  analogues  for  large  cardinals  provided  new  characterizations  and 
insights,  which  in  turn  reverberated  back  to  the  original  context. 

Whether  some  restrictive  hypothesis  like  V = L is  necessary  in  6.13  was  left 
unresolved  for  some  time,  until  Kai  Hauser  established  a contrasting  result  with 
sophisticated  forcing  techniques: 
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6.14  Theorem  (Hauser  [91,92]).  If  m > 1,  n > 0,  and  there  is  a Ff- 
indescribable  cardinal  and  a IT" -indescribable  cardinal  below  it,  then  there  is 
a generic  extension  with  such  cardinals  in  which  of  > nf . H 

This  was  somewhat  unexpected,  since  the  straightforward  definitions  and  the 
m — 1 precedent  suggested  an  absolute  result  in  the  Moschovakis  direction.  (The 
ordering  of  the  cardinals  in  the  hypothesis  of  the  theorem  is  necessary,  since  from 
of  > 7i f one  can  establish  the  consistency  of:  there  are  both  17,'” -indescribable 
cardinals  and  77,'" -indescribable  cardinals.) 

The  study  of  indescribability  set  the  stage  for  the  general  investigation  of 
large  cardinals  in  terms  of  reflection  phenomena,  and  also  introduced  an  important 
technique,  the  use  of  definable  normal  filters.  6.14  may  have  resolved  what  had 
been  the  major  open  problem  about  indescribability,  but  in  any  case  the  concept 
provides  a useful  framework  for  the  analysis  of  the  relative  size  of  various  large 
cardinals. 


Chapter  2 

Partition  Properties 


This  chapter  describes  the  progression  of  ideas  and  results  emerging  from  parti- 
tion properties  first  considered  by  Erdos  and  culminating  in  Silver’s  results  about 
the  existence  of  the  set  of  integers  0#,  a principle  of  transcendence  over  L.  This 
development  incorporated  both  the  refined  analysis  of  combinatorics  as  well  as 
the  full  play  of  model-theoretic  techniques,  and  provided  formulations  that  have 
come  to  be  regarded  as  basic.  §7  explores  partitions  of  n-tuples,  developing  the 
related  tree  property  and  further  characterizations  of  weak  compactness,  and  in- 
troduces partitions  of  all  finite  subsets.  §8  gives  Rowbottom’s  model-theoretic 
characterizations  and  results  about  L,  and  explores  the  related  concepts  of  Row- 
bottom  and  Jonsson  cardinals.  Finally,  §9  presents  Silver’s  definitive  work  on  sets 
of  indiscemibles  and  the  implications  for  L of  the  existence  of  0#. 
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7.  Partitions  and  Trees 

Although  an  itinerant  mathematician  for  most  of  his  life,  Paul  Erdos  has  been 
the  prominent  figure  of  a strong  Hungarian  tradition  in  the  concrete  mathematics 
of  combinatorics,  and  through  some  initial  results  he  introduced  major  initiatives 
into  the  detailed  combinatorial  study  of  the  transfinite.  Erdos  and  his  collaborators 
simply  viewed  the  transfinite  numbers  as  a combinatorially  rich  source  of  intrinsi- 
cally interesting  problems.  The  lines  of  inquiry  that  they  pursued  soon  developed 
a momentum  of  their  own,  with  the  concrete  questions  about  graphs  and  mappings 
having  a natural  appeal  through  their  immediacy.  Some  of  the  initial  speculations 
led  to  important  correlations  with  weak  compactness,  and  these  ramifications  are 
discussed  in  this  section.  Enough  was  known  by  the  late  1950’s  so  that  the  result 
that  the  least  inaccessible  cardinal  is  not  measurable  could  have  been  deduced 
before  the  Hanf-Tarski  work.  That  this  combinatorial  research  should  have  some 
bearing  on  large  cardinals  was  not  surprising,  but  that  it  was  to  play  a central  role 
in  their  structural  elaboration  in  the  1960’s  (§§8,9)  was  rather  unexpected. 

Despite  later  developments  the  story  begins  with  a problem  in  formal  logic. 
A couple  of  years  before  Gddel  established  the  Incompleteness  Theorem  [31], 
Ramsey  [30]  demonstrated  the  decidability  of  the  class  of  3V  formulas  with  iden- 
tity (see  Dreben-Goldfarb  [79]  for  a general  framework).  It  was  for  this  purpose 
that  he  proved  his  well-known  finite  combinatorial  theorem.  Up  at  Cambridge 
and  brother  to  a later  Archbishop  of  Canterbury,  Ramsey  through  his  association 
with  Bertrand  Russell  and  Ludwig  Wittgenstein  is  a pivotal  figure  in  the  philos- 
ophy of  mathematics.  With  the  latter  at  his  side,  Ramsey  died  tragically  at  the 
age  of  26.  Skolem  [33]  sharpened  Ramsey’s  work,  but  his  theorem  did  not  be- 
come widely  known  until  Erdos-Szekeres  [35]  rediscovered  it  and  applied  it  to  a 
problem  of  combinatorial  geometry.  Today,  Ramsey  Theory  is  a thriving  field  of 
combinatorics  (see  Graham-Rothschild- Spencer  [90]). 

Ramsey  also  established  an  infinite  version  of  his  combinatorial  theorem 
which  is  just  as  well-known,  and  it  was  the  investigation  of  its  analogues  in  the 
transfinite  that  led  to  large  cardinal  properties.  In  the  joint  paper  Erdos-Tarski  [43] 
weakly  inaccessible  cardinals  were  incorporated  into  a discussion  about  maximal 
mutually  disjoint  families.  But  enticingly,  that  paper  ended  with  an  intriguing 
list  of  six  combinatorial  problems  (and  announced  some  interconnections  between 
them  in  a footnote)  whose  positive  solutions  amounted  to  the  existence  of  either  a 
strongly  compact,  measurable,  or  weakly  compact  cardinal.  It  is  evident  that  here 
the  authors  were  motivated  by  strong  properties  of  co  to  formulate  direct  combina- 
torial generalizations . They  took  a distinctly  empirical  approach  to  foundational 
issues,  considering  their  problems  as  lines  of  inquiry  towards  possible  new  axioms. 
They  speculated  ([43: 328ff]): 

The  difficulties  which  we  meet  in  attempting  to  solve  the  problems  under  consideration 
do  not  seem  to  depend  essentially  on  the  nature  of  inaccessible  numbers.  In  most  cases 
the  difficulties  seem  to  arise  from  lack  of  devices  which  enable  us  to  construct  maximal 
sets  which  are  closed  under  certain  infinite  operations.  It  is  quite  possible  that  a complete 
solution  of  these  problems  would  require  new  axioms  which  would  differ  considerably  in 
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their  character  not  only  from  the  usual  axioms  of  set  theory,  but  also  from  those  hypotheses 
whose  inclusion  among  the  axioms  has  previously  been  discussed  in  the  literature  and 
mentioned  previously  in  this  paper  (e.g.,  the  existential  axioms  which  secure  the  existence 
of  inaccessible  numbers,  or  from  hypotheses  like  that  of  Cantor  which  establish  arithmetical 
relations  between  the  cardinal  numbers). 

At  first,  it  might  seem  as  if  the  possibility  of  such  prior  principles  has  not  been 
realized,  since  the  positive  solutions  to  their  problems  have  simply  been  adopted 
as  large  cardinal  hypotheses.  But,  reflection  principles  studied  in  the  1960’s  (§6) 
have  justified  “maximal  sets  which  are  closed  under  certain  infinite  operations”. 
In  particular,  the  characterization  of  weak  compactness  via  77/  -indescribability 
has  provided  a local  explanatory  principle  leading  to  the  positive  solutions  of  the 
combinatorial  problems,  and  the  second-order  indescribability  principle  of  Bernays 
[61],  essentially  the  77,/  -indescribability  of  On  for  every  n e co,  a global  one. 

The  program  initiated  by  Erdos  is  now  described,  so  that  the  related  problems 
of  Erdos-Tarski  [43]  can  be  considered  in  context.  Richard  Rado  was  Erdos’  main 
collaborator  in  this  direction  in  the  1950’s,  and  Hajnal,  since  then.  It  is  interesting 
to  note  that  like  his  earlier  compatriot  von  Neumann,  Hajnal’s  initial  work  was  in 
the  axiomatics  of  set  theory  (see  before  3.2),  but  his  concerns  were  quite  different 
afterwards.  A general  framework  called  a partition  calculus  was  developed  by 
Erdos-Rado  [56],  and  the  starting  point  is  a special  case  of  their  ordinary  partition 
symbol.  Recall  that  for  x C On,  [x]y  — [y  C x \ y has  ordertype  y}.  The 
ordinary  partition  relation 

P — ► («)£ 

asserts  that  for  any  /:  [/l]7  — > 8,  there  is  an  H e [/)]“  homogeneous  for  /: 
\f“[H]Y\  < 1.  In  other  words,  for  any  partition  of  the  ordertype  y subsets  of  P 
into  S cells  there  is  an  H C p of  ordertype  a all  of  whose  ordertype  y subsets  lie 
in  the  same  cell.  The  negation  of  this  and  like  relations  is  indicated  with  a -74- 
replacing  the  — > . The  idea  behind  this  “arrow”  notation  is  that  the  relation  is 
obviously  preserved  upon  making  the  P on  the  left  larger,  or  making  any  of  the 
a,  y,  S on  the  right  smaller,  as  long  as  the  order  relationship  between  y and  a is 
preserved,  e.g.  the  trivially  true  case  a < y can  become  false  when  y < a.  Of 
course,  8 can  be  taken  to  be  a cardinal,  and  if  a is  a cardinal,  then  the  least  p 
satisfying  the  relation  must  also  be  a cardinal.  Ramsey’s  infinitary  theorem  is  the 
assertion  co  — > (&>)]"  for  every  m , n e co  and  is  established  by  7.7. 

An  early  comment  was  that  y must  be  finite  in  the  presence  of  the  Axiom  of 
Choice: 

7.1  Proposition  (Erdos-Rado  [52:  434]).  For  any  k,  x—f  («)® . 

Proof.  Let  -<  well-order  [/c]®,  and  define  /:  [/c]®  — > 2 by  f(s)  — 0 if  every 
t e [s]®  — [s]  satisfies  s -<  t,  and  f(s)  — 1 otherwise.  Then  no  x e [/c]®  can 
be  homogeneous  for  /:  If  y is  the  -<-least  member  of  [x]®,  then  f(y)  — 0. 
But  taking  any  infinite  increasing  C-chain  xo  Cii  C x2  . . . C x of  infinite  sets, 
f(xn)  = 0 for  every  n would  imply  that  . . . x2  -<  x\  -<  xo,  contrary  to  -<  being  a 
well-ordering.  H 
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Erdos-Rado  [56]  provided  some  basic  results  for  the  general  ordinary  partition 
relation.  Working  independently  Djuro  Kurepa  [59]  also  obtained  similar  results. 
Later,  an  almost  complete  theory  for  cardinals  was  given  in  Erdos-Hajnal-Rado 
[65]  assuming  GCH.  Incorporating  the  further  work  of  Shelah  [75]  the  book  Erdos- 
Hajnal-Mate-Rado  [84]  then  extended  the  discussion  with  Byzantine  detail  to  the 
general  situation  without  GCH.  What  lies  at  the  heart  of  these  matters  is  a basic 
argument  for  producing  large  homogeneous  sets  using  trees. 

A tree  is  a partially  ordered  set  (T,  <T)  such  that  for  any  t e T the  set 
{u  e T \ u <T  t}  of  <t -predecessors  of  t is  well-ordered  by  <t-  The  ath  level 
of  T consists  of  those  t e T whose  set  of  <t -predecessors  has  ordertype  a under 
<t . The  height  of  T is  the  least  a such  that  the  ath  level  of  T is  empty.  A chain 
of  T is  a linearly  ordered  subset,  and  an  antichain  of  T is  a subset  consisting  of 
pairwise  <t -incomparable  elements.  For  example,  any  level  is  an  antichain.  A 
branch  of  T is  a maximal  chain  of  T,  a cojinal  branch  of  T is  a branch  with 
members  at  every  non-empty  level  of  T,  and  an  a-branch  of  T is  a branch  whose 
ordertype  under  <T  is  a.  A subtree  of  T is  a subset  T C T together  with  the 
induced  ordering  on  T . Although  not  always  done  in  set  theory,  it  is  assumed  that 
the  Oth  level  of  a tree  consists  of  a single  element,  called  the  root. 

Trees  abound  in  contemporary  set  theory  as  basic  combinatorial  objects.  The 
first  systematic  study  of  trees  was  carried  out  in  Kurepa’s  Paris  thesis  [35]  with 
Frechet,  where  several  tree  and  linear  order  equivalences  are  derived,  e.g.  for 
Suslin’s  Problem  [35: 127ff],  See  Levy  [79:IX§2]  for  the  basic  combinatorics, 
Devlin  [84]  for  trees  in  L,  and  Todorcevic  [84]  for  a magisterial  discussion.  The 
basic  example  is  (<yx,  c>,  the  set  <yx  — ordered  by  proper  inclusion, 

which  for  x ^ 0 is  a tree  of  height  y with  ath  level 01  x. 

The  following  lemma  provides  a basic  analysis  of  partitions  in  terms  of  trees 
and  is  known  in  general  form  as  the  Ramification  Lemma  (cf.  Erdos-Hajnal-Rado 
[65:  §6]).  It  is  used  to  establish  a close  relationship  between  large  homogeneous 
sets  for  partitions  and  long  chains  in  trees,  and  as  such  provides  the  basis  for  the 
quintessential  tree  argument  in  infinitary  combinatorics:  Instead  of  building  large 
homogeneous  sets  directly  by  recursion,  one  argues  that  because  a tree  must  have 
long  chains,  some  sufficiently  large  homogeneous  set  must  exist.  The  first  argu- 
ments of  this  sort  were  those  appealing  to  the  well-known  tree  lemma  of  Denes 
Konig  [27],  a generalization  of  which  will  be  considered  shortly.  Recall  our  §0 
convention  that  /(a i, . . . , a„)  is  written  for  /({a i, . . . , a„})  with  the  understand- 
ing that  a\  < . . . < an. 


7.2  Lemma.  Suppose  that  2 < n < co,  a is  a cardinal,  and  f : [a:]"  — > a . Then 
there  is  a tree  ( k , <f)  such  that: 

(a)  If  f <f  rj,  then  f < rj. 

(b)  If  <f  ...  <f  i </  S <f  rj,  then 


/(§!,...,  8)  = £,-1,1?). 
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(c)  For  each  a < k,  the  ath  level  of  the  tree  has  cardinality  at  most  cr'<0+cl\ 
and  is  finite  if  both  a and  a are  finite. 

Proof.  The  tree  is  defined  by  recursion,  initially  letting  the  first  n levels  be 
0</l  </...</«  — 1.  Suppose  that  n < ij  < k,  and  every  £ < tj  has  already 
been  put  on  the  tree,  i.e.  </ 1 fi  x rj)  has  been  defined.  Then  choose  a downward 
closed  chain  b of  the  tree  thus  far  constructed  maximal  with  respect  to  having  the 
property: 

for  any  £i  </...</  f„  all  in  b,  f(f,  . . . , f„)  = f{fx £„_i,  rj)  , 

and  stipulate  that  f </  i?  for  every  £ e b. 

(Actually,  this  b is  unique:  If  b\  f Z?2  were  chains  that  met  the  conditions  for 
b,  let  c — b i fl  b2,  Y l the  </-least  element  of  b\  — c,  and  Y2  the  </-least  element 
of  b2  — c,  say  with  y\  < y2.  Then  if  £ i </  ...  </  |„_i  are  all  in  c, 

/(fl,  • • • - fn-1,  yi)  = /(f 1,  • • ■ , fn-1,  b)  = /(fl,  • • • , fn-1,  K2)  • 


But  this  is  contradictory,  since  y2  had  been  put  atop  c,  yet  c did  not  meet  the 
maximality  condition  at  the  time  because  of  c U {yi}.) 

(k,  <f)  is  a tree  satisfying  (a)  and  (b).  To  complete  the  proof,  (c)  is  estab- 
lished by  induction:  At  limit  ordinals  a > 0,  first  note  that  if  b is  a maximal  chain 
through  that  part  of  the  tree  below  the  ath  level,  then  b has  at  most  one  successor 
at  the  ath  level:  If  ry  < rj2  were  two  successors,  for  any  £i  </...</  £„_i  all  in 
b,  taking  a £ e b such  that  £„_i  </  f as  a is  a limit, 


/(fl,  ■ ■ ■ , fB-i,  m)  = /(fl,  ■ • ■ , ffl-i,  f)  = /(fl,  ■ ■ ■ , fB-i,  m)  ■ 


But  this  is  contradictory,  since  rj2  had  been  put  atop  b,  yet  b was  not  maximal  at 
the  time  because  of  b U {r]\}.  Thus,  the  cardinality  of  the  ath  level  is  at  most  the 
cardinality  of  the  set  of  maximal  chains  through  that  part  of  the  tree  below  the 
ath  level.  By  induction  this  is  at  most  er1"^1  = a1"1  as  required. 

For  the  argument  at  successor  ordinals,  first  note  that  if  £ is  at  the  ath  level, 
then  § has  at  most  cr1"1”  " immediate  successors  at  the  (a  + l)st  level:  If  i]\  < rj2  are 
both  immediate  successors  of  f then  there  must  be  some  §i  </  ...  </  §„_i  </  £ 
such  that 

(*)  /(§1 §»-l,  »ll)  # /(fl f»-l.  »l: 2) 

else  r]2  would  not  have  been  an  immediate  successor,  as  before.  In  fact,  §„_i 
must  equal  § , else  (*)  would  be  an  equality  just  as  in  the  previous  paragraph. 
There  are  at  most  |a|"-2  (n  — 2)-tuples  (£i, . . . , f„_2)  as  £ is  at  the  ath  level, 
and  each  immediate  successor  ?;  of  f determines  for  each  such  (n  — 2)-tuple  a 
value  /(fi, ....  f„_2,  f,  fi)  < a.  Since  these  values  must  be  different  as  in  (*)  for 
distinct  immediate  successors,  there  must  be  at  most  ct1"1"  ' immediate  successors. 

It  is  now  simple  to  complete  the  argument:  By  induction,  the  ath  level  has 
cardinality  at  most  f‘0+a\  ■ er1^"  ‘ < erl<"+Q/  Also,  if  a and  a are  both  finite  and 
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by  induction  the  ath  level  is  finite,  then  so  is  the  (a  + l)st  level  since  a1"'"  2 is 
finite.  H 

To  illustrate  how  this  lemma  is  used,  with  it  we  establish  what  is  called  the 
Erdos-Rado  Theorem,  which  shows  in  particular  that  for  any  a,  S,  and  new  there 
is  a k such  that  k — > (a)"+l . The  case  n — 1 had  occurred  in  Erdos  [42];  the  full 
theorem  was  established  independently  by  Kurepa  [59].  For  the  statement  of  the 
theorem,  define  the  first  few  Beth  numbers  based  at  a cardinal  k by:  betho(/r)  = k 
and  beth„+i(/r)  = 2beth"(,<:)  for  n e w. 

7.3  Theorem  (Erdos-Rado  [56]).  For  any  k and  n e w, 

beth„  (k  )+  — » (K+T+1  . 

Proof.  Proceed  by  induction  on  n.  The  case  n — 0 follows  from  the  regu- 
larity of  k+.  Assume  now  that  the  assertion  is  true  for  n,  and  suppose  that 
/:  [beth„+i(/r)+]"+2  ->  k.  Let  </  be  the  corresponding  tree  ordering  on 
beth„+i(v)+  as  in  the  lemma.  Then  the  beth„(K')+th  level  of  the  tree  is  not  empty: 
Otherwise,  (c)  of  the  lemma  would  lead  to  the  contradiction 

beth„+i(/r)+  < U{kw  | a < (beth„ (at))+]  < 2beth,,w  = beth„+i(v)  . 

Let  r]  be  at  the  beth„(v)+th  level  and  consider  the  chain  C = {£  | £ </  >;}. 
By  (a)  and  (b)  of  the  lemma  the  function  g:  [C]"+1  — >■  k given  by 


g($  t,  • • • , $n+ 1)  = f($u  ■■■,  $n+ 1,8)  for  some  (any)  S eC  - (f„+i  + 1) 

is  well-defined.  But  then,  the  inductive  hypothesis  can  be  applied  to  g to  extract 
a subset  of  C of  cardinality  k+,  homogeneous  for  the  original  /.  H 

7.3  is  known  to  be  best  possible  in  the  sense  that  beth„  (/r)+  cannot  be  replaced 
by  beth„(*:)  (Erdos-Hajnal-Rado  [65]).  This  follows  from  their  “stepping-up” 
lemma  ([65:  118ff]),  starting  from  two  basic  counterexamples  for  n — 1 that  had 
been  noticed  much  earlier: 

7.4  Exercise  (Godel  - Erdos-Kakutani  [43:459]).  For  any  k, 

2*-M3)2  • 

Hint.  With  (fa  \ a < 2K)  enumerating  K2  consider  F:  [2K]2  — > k defined  by: 
F(a,  0)  = least  f such  that  fa(f)  f fp(f).  H 

7.5  Proposition  (Sierpinski  [33];  Kurepa  [41:487]).  For  any  k, 

2^— 7^>  (at  + )2  . 

Proof.  Let  < be  the  lexicographic  ordering  of  K2,  i.e.  for  / f g e K2,  f < g iff 
f(a)  < g(a)  for  the  least  a < k such  that  f(a)  / g(a).  With  (/„  | or  < 2K) 
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enumerating  K2  define  F:  [2‘]2  ->  2 by:  F{a , ft)  — 0 if  fa  < fp,  and  F(a,  ft)  — 1 
otherwise. 

Assume  to  the  contrary  that  there  is  an  H e [2K]K  homogeneous  for  F . Take 
F“[H]2  — {0};  the  other  case  is  entirely  analogous.  For  each  a e H let  d(a ) be 
the  least  ordinal  § < k such  that  for  some  y e H,  /„  ||  = fy  |£  yet  < fy  (§ ) . 

For  £ < k,  set 

Hj:  — {a  e H | d(a ) = £}  . 

Note  that  for  any  £ < k,  if  a,  P e H then  /„||  = /^ll-  Thus,  if  a e and 
y e //  is  such  that  /a|f  = /r|§  yet  /«(§)  < fy  (|),  then  any  other  member  of 
has  this  property  with  respect  to  the  same  y.  Hence,  c [a  e H \ fa  < fY}, 
and  so  |//^|  < k,  since  by  homogeneity  H is  well-ordered  by  < in  ordertype  k + . 
However,  this  contradicts  the  regularity  of  k+  since  H = H^.  H 

For  any  X,  using  7.3  there  is  a sequence  (Aa-  | k e a>)  such  that  Xo  = X 

and  for  k e co,  Xk+\  — ^ (a)"  for  any  a,  S < Xj,  and  n e w.  It  follows  that  if 

k — sup({kA  | k e co}),  then  k > X and  k — > ( a)ns  for  any  a,S  < k and  n e co. 
The  following  result  is  about  the  next  interesting  possibility,  directly  generalizing 
Ramsey’s  co  — > ( co)\ : 

7.6  Corollary  (Erdos).  If  k > co  and  k — > {k)\,  then  k is  inaccessible. 

Proof.  First,  k must  be  regular:  Otherwise,  k would  be  a disjoint  union  k — 
lj^<y  Xj:,  where  y < k and  |A^|  < k for  each  ^ < y.  Then  the  function 
/:  [/c]2  — >■  2 defined  by  /(a,  P)  = 0 if  a and  P are  in  the  same  X|,  and  /(a,  ft)  = 
1 otherwise,  cannot  have  a homogeneous  set  of  cardinality  k. 

Next,  k must  be  a strong  limit:  Otherwise,  there  would  be  a X < k such  that 

k < 2X.  But  then,  k — > (k)\  would  contradict  7.5  for  X.  H 

The  weaker  conclusion  that  k is  weakly  inaccessible  is  almost  explicit  in 
Erdos  [42]  (cf.  Erdos-Tarski  [43:328]). 

The  Tree  Property 

The  proof  of  7.3  shows  how  the  existence  of  large  homogeneous  sets  can  be  related 
to  the  existence  of  long  chains  in  trees.  For  k — > ( k )2,  the  key  feature  is  a strong 
property  considered  by  Kurepa  [35]  that  replaces  the  cardinality  argument  of  7.3: 
A K-tree  is  a tree  of  height  k each  of  whose  levels  has  cardinality  less  than  k. 

k has  the  tree  property  iff  every  /c-tree  has  a cofinal  branch  . 

This  generalizes  the  well-known  tree  lemma  of  Konig  [27],  which  is  just  the 
assertion  that  oo  has  the  tree  property.  The  following  result  will  shortly  be  seen  to 
provide  another  characterization  of  weak  compactness. 

7.7  Exercise  (Ramsey  [30]  for  k = co;  Erdos-Tarski  [43:328]  for  n = X = 2). 
If  K — co,  or  is  inaccessible  and  has  the  tree  property,  then  for  any  n e co  and 
X < k,  k — ^ fir)". 
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Hint.  Use  the  inductive  argument  for  7.3;  for  the  case  k = a>,  apply  the  last  clause 
of  7.2(c).  H 

Ramsey’s  original  proof  was  essentially  this,  although  he  did  not  formulate 
Konig’s  lemma  explicitly. 

As  mentioned  in  §4  the  details  of  implications  announced  at  the  end  of  Erdos- 
Tarski  [43]  were  provided  in  a seminar  at  Berkeley  in  1958-9  by  Mostowski  and 
Tarski,  and  appeared  in  Erdos-Tarski  [61].  In  addition  to  the  crucial  Hanf  (§4) 
and  Scott  (§5)  results  that  followed,  the  tree  and  partition  equivalences  for  weak 
compactness  were  soon  worked  out: 

7.8  Theorem  (Erdos-Tarski  [43,61],  Hanf  [64a],  and  Monk-Scott  [64]).  The 
following  are  equivalent  for  k > co: 

(a)  k is  weakly  compact. 

(b)  k is  inaccessible  and  has  the  tree  property. 

(c)  k — > (K)"for  every  n < a>  and  X < k. 

(d)  k — > (k)\. 

Proof,  (a)  ->  (b).  Inaccessibility  is  a consequence  of  weak  compactness.  To 
establish  the  tree  property  a typical  compactness  argument  can  be  used:  Let 
(T,  <T)  be  a /c-tree.  To  each  t e T associate  a propositional  (O-ary  predicate) 
symbol  P, , and  consider  the  collection  of  LKW  sentences  consisting  of:  disjunc- 
tions \/{Pt  | t is  at  the  ath  level  of  T } for  a < k,  and  —•(P,  A Pti)  for  <7- 
incomparable  t,  t'  e T . Since  T has  height  k,  this  collection  of  k sentences  is 
/c-satisfiable.  Hence  by  weak  compactness,  it  is  satisfiable,  say  by  a model  M. 
[r  e T | M f=  P,}  is  then  a cofinal  branch  through  T . 

(b)  — > (c)  follows  from  7.7,  and  (c)  ->  (d)  is  immediate.  To  complete  the 
proof,  it  will  be  convenient  to  establish  (d)  ->  (b)  -*  (a). 

(d)  — * (b).  Inaccessibility  follows  from  7.6.  To  establish  the  tree  property, 
let  ( T , <T)  be  a a: -tree.  Since  |7j  = k,  it  can  be  assumed  that  T = k.  Iff  < k is 
at  a level  > «,  let  :r„(f)  be  its  predecessor  at  level  a (allowing  7ta(f)  — f iff  is 
at  the  ath  level).  <t  can  be  extended  to  a linear  ordering  of  k by  specifying 
how  it  orders  two  <r -incomparable  elements  f , r/  as  follows:  Let  a be  the  least 
such  that  7r„(f ) f JTa(q),  and  define  f -<  q iff  na(f)  < 

Now  define  /:  [/c]2  —>  2 by  /( f , q)  — 0 if  f -<  q,  and  /( f , q)  = 1 otherwise. 
By  k — > (k)\  let  H e [k]k  be  homogeneous  for  /.  Since  every  level  of  the 
tree  has  cardinality  less  than  k,  there  is  a pa  < k such  that  if  pa  < f and  f e H 
then  f is  at  a level  > a.  By  definition  of  -<  if  two  ordinals  y ~<  S are  both  at 
levels  > a,  then  rta(y)  -<  7Ta(S)  or  7Ta(y)  — tTq, (5).  Hence,  if  /“[H]2  = [0],  then 
(7r„(f ) | pa  < f A f e H)  is  non— < -decreasing  in  f , and  if  f“[H]2  — [1],  then  it 
is  non— < -increasing.  In  either  case  there  is  a au  < k and  a /;„  such  that  if  < f 
and  f e H,  then  7ra(f)  = ba.  Clearly,  (ba  \ a < k)  is  a cofinal  branch  since  any 
two  bf  s have  a common  <T -successor. 

(b)  -»  (a).  The  Extension  Property  (4.5)  for  weak  compactness  will  be 
verified.  So,  suppose  that  R c V);  a transitive  set  X f VK  and  anScx  must 
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be  found  so  that  (VK,  e,  R)  < ( X , e,  S).  By  the  inaccessibility  of  k and  6.1, 

{a  <k  | (Va,e,RnVa)  -<  (VK,e,R)} 

is  closed  unbounded  in  k;  let  (a%  | £ < k)  be  its  increasing  enumeration. 

A tree  (T,  <t)  is  now  defined.  Fix  a complete  set  of  Skolem  functions  (§0) 
for  (VK,  e,  R),  and  define  Skolem  hulls  for  any  X c VK  in  what  follows  as  being 
with  respect  to  this  particular  set.  For  otf  < ft  < k let 

H(f,  yd ) = the  Skolem  hull  of  Va(  U {yS}  in  (V*,  e,  R) 

so  that 

{Vav  €,Rn  y0{>  -<  . 

Define 

//(§,  yd)  & H(£ , P)  iff  ^ ^ and  there  is  an  isomorphism  between  the 

two  structures  fixing  and  sending  yd  to  yd  . 

Clearly,  ~ is  an  equivalence  relation,  and  so  let  [//(§,  yd)]  denote  the  equivalence 
class  of  //(§ , yd).  The  elements  of  our  tree  T are  to  be  the  [//(§,  yd)]’s.  Finally, 
set 

[H(Z,  yd)]  <T  [H( f,  yd)]  iff  ^ < f,  yd  < /},  and  yd)  is  isomorphic 

to  the  Skolem  hull  of  U ]yd]  in  //(f , yd)  . 

That  (T,  <7-)  is  indeed  a tree  is  not  difficult  to  see;  note  that  the  §th  level  of  T is 
{[H (f,  yd)]  | txj:  < yd  < Ar } . That  {T,  <T)  is  a /r-tree  follows  from  the  inaccessibility 
of  k,  there  being  at  most  2|lV  < k Skolem  hulls  up  to  isomorphism  generated  by 
Vai  U ]x]  for  sets  x. 

Hence,  by  the  tree  property  there  is  a k -branch  {[H(j~ , yd^)]  | £ < k)  through 
T . By  definition  of  <-/■,  whenever  § < 17  < k there  is  an  elementary  embedding 
H( yd^)  -<  H(r),  yd,)  that  fixes  Va{  so  that  = yd,.  From  the  construc- 

tion it  is  seen  that  £ < t]  < p < k implies  that  igp  = i,w  o if,.  Thus,  the  direct 
limit  (§0)  can  be  formed,  and  it  is  well-founded  as  cf (k)  > co.  The  transitive 
collapse  (0.4)  is  then  an  elementary  extension  ( X , e,  S)  of  (VK,  e,  R).  Since  the 
yd|’s  get  identified  together  to  correspond  to  an  ordinal  yd  e X such  that  yd  > k, 
(X,  e,  S)  is  as  required.  H 

The  proof  of  (a)  -»  (b)  shows  that  weak  compactness  is  equivalent  to  its 
defining  property  restricted  to  propositional  logic  for  LK(0.  Be  that  as  it  may, 
nowadays  the  roots  of  the  concept  in  infinitary  languages  are  usually  passed  over 
in  favor  of  its  more  immediate  formulations,  particularly  (b).  Just  as  Konig’s 
lemma  is  the  mathematical  essence  of  the  compactness  property  of  Lmm,  so  the 
tree  property  provides  the  gist  of  the  generalization.  Further  tree-related  equiv- 
alences for  weak  compactness  have  been  provided  by  Baumgartner  [75: 123], 
Shelah  [79],  Erdos-Hajnal-Mate-Rado  [84:  §31]  (also  due  to  Baumgartner),  and 
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Todorcevic  [87].  Harkening  back  to  the  efforts  of  Keisler-Tarski  [64]  (§4),  much 
of  the  interest  has  been  in  devising  avowedly  combinatorial  proofs  of  the  results 
on  the  size  of  weakly  compact  cardinals. 

The  tree  property,  decoupled  from  inaccessibility  and  hence  from  the  partition 
property  k — > (/c)|,  has  come  to  be  regarded  as  intrinsically  interesting  in  its 
own  right,  especially  after  the  advent  of  forcing  which  led  to  relative  consistency 
results  about  accessible  cardinals.  That  a>\  does  not  have  the  tree  property  is  a 
1934  result  of  Nathan  Aronszajn,  presented  in  Kurepa  [35: 96]  and  acknowledged 
(inaccurately)  in  Erdos-Tarski  [43:328],  A fellow  student  with  Kurepa  in  Paris 
at  the  time,  Aronszajn  may  seem  a shadowy  figure  in  the  annals  of  logic,  but  he 
became  well-known  as  a functional  analyst  (e.g.  [52]).  With  but  one  paper  [52a] 
in  set  theory  and  on  a topic  unrelated  to  trees,  it  is  a happenstance  of  mathematics 
that  his  name  has  become  commonplace  in  infinite  combinatorics: 

A K-Aronszajn  tree  is  a a: -tree  with  no  cofinal  branch  . 

Hence,  a a: -Aronszajn  tree  is  simply  a counterexample  to  the  tree  property  for  k. 
Note  that  if  k is  singular,  then  there  is  a K-Aronszajn  tree : Let  k — [0]  = Uu<y  Xa 
be  a disjoint  union  with  y < k and  |Z„|  < k for  each  a < y,  and  consider  {k,  <t) 
where  f <t  £ iff  £ < ? and  f £ £ {0}  U Xa  for  some  a < y.  For  regular  k, 
uniformly  thin  a: -trees  do  have  cofinal  branches: 

7.9  Proposition  (Kurepa  [35:  80]).  Suppose  that  k is  regular,  X < k,  and  ( T , <T) 
is  a K-tree  each  of  whose  levels  has  cardinality  less  than  X.  Then  ( T , <j)  has  a 
cofinal  branch. 

Proof.  It  can  be  assumed,  by  trimming  the  tree  if  necessary,  that  distinct  members 
at  a limit  level  have  a distinct  <r -predecessors  at  some  earlier  level.  It  can  further 
be  assumed  that  T — k.  Let  Ta  denote  the  ath  level  of  the  tree,  so  that  \T„\  < X. 
The  result  is  first  established  assuming  that  X is  regular: 

For  each  a < k with  cf(a)  = X,  choose  a f „ e Ta.  Then  for  such  a there 
is  an  f(a)  < a such  that  for  any  £ € Ta  — {£„},  £ and  £„  have  no  common 
<t -predecessor  in  Tf(a).  Since  [a  < a:  | cf(a)  = A]  is  stationary  in  k,  it  has 
(0.1(c))  a subset  X unbounded  in  k on  which  / is  constant,  say  with  value  y.  It 
follows  from  | Ty  \ < k that  there  is  a Y C X unbounded  in  k such  that  one  fixed 
member  of  Ty  is  a <T -predecessor  to  every  member  of  {^a  \ a e Y}.  But  then, 
{£„  | a e Y}  is  readily  seen  to  be  a chain  in  (k,  <t ),  and  hence  it  determines  a 
cofinal  branch. 

For  singular  X < k,  there  is  a regular  v < X so  that  Y — {a  < k \ \TU\  < u] 
is  unbounded  in  at.  By  the  previous  argument,  the  subtree  determined  by  Uaey  Ta 
has  a cofinal  branch.  But  any  such  branch  determines  a cofinal  branch  through 
the  original  tree.  H 

Specker  generalized  Aronszajn’ s original  result  under  a cardinality  assump- 
tion. A proof  based  on  the  elegant  “minimal  walk”  construction  of  Stevo 
Todorcevic  is  provided: 
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7.10  Theorem  (Aronszajn  for  k — a>  - Kurepa  [35:96];  Specker  [49]).  If  k is 
regular  and  2<K  — k,  then  there  is  a k+ -Aronszajn  tree. 

Proof  (Todorcevic  [87]).  For  each  a < k+  let  Ca  be  closed  unbounded  in  a 
of  ordertype  < k;  successor  ordinals  can  be  accommodated  by  just  requiring 
f e Cj+ 1-  Define  pp\  p ->  <0>\T(.P)~\  for  each  p < k+  as  follows:  For  any 
a < P,  let  Pq  = P and  Pf+l  = min(C^«  — a)  < P"  until  PJJ  = a;  then  set 

Pp(&)  = {Cp«  n a | i < n)  . 

Several  properties  are  noted: 

(i)  If?  < a < p < k+  , then  there  is  a unique  j e u>  such  that  p\  — p “ 
for  i < j and  | < Pj+l  < a . 

(ii)  If  £ < a < p < k+  and  pp (f)  and  pp(a)  have  the  same  length  n, 
then  for  some  j < n,  CBi  flf  is  a proper  initial  segment  of  fla  . 

(iii)  If  a < p < y and  pp(a)  — py{a),  then  pp\a  — pY\a  . 

For  (ii),  take  j as  in  (i);  then  /l|  = PJ  and  C^t  fl  § is  a proper  initial  segment 

of  C p«  fl  a as  Pj+ 1 belongs  to  the  latter  set.  For  (iii),  suppose  that  f < a.  Let  j 

be  maximal  such  that  Pj  — pj  and  y-  = yf.  Then  for  i < j,  the  consequence 
C pa  fl  a = Cya  fl  a of  pp(a)  — py{a)  implies  that  Cy  IT  £ = Cye  fl  Moreover, 

dU  i < a or  yj+1  < a by  (i).  In  either  case,  the  consequence  Cpj  IT  a = Cyj  fl  a 

of  pp(a)  = py( a)  implies  that  fij+l  — yj+x,  and  thus  that  PJ  = yf  for  i > j. 
Hence,  pp(f)  = py(f  ). 

Set 

T = {pp\a  \ a < p < k+]  . 

Then  (T,  C)  is  a tree  of  height  k+  which  is  k+ -Aronszajn:  First,  if  b were  a cofinal 
branch  through  T , then  f = [Jb  would  be  a function  with  domain  k+.  Since  the 
ordertypes  of  the  Cf  s are  at  most  k,  there  would  be  an  X c k+  with  |X|  = k+ 
and  fixed  n e co  and  ordinals  ^o, . . . , i such  that  for  any  a e X,  f(a)  has 
length  n and  its  ith  element  has  ordertype  But  then  b could  not  have  been  a 
branch  by  (ii). 

To  complete  the  proof,  note  that  for  any  a < k+  the  definition  of  the  pp’s 
and  (iii)  imply  that  the  crth  level  of  T has  cardinality  at  most 

\{Cp  fl  a | a < P < k+}|  < k<k  — k 

since  2<K  — k and  k is  regular.  H 

The  original  constructions  of  k+ -Aronszajn  trees  ensured  a further  property 
that  has  become  pivotal  in  subsequent  developments: 


A v-tree  is  special  iff  it  is  the  union  of  fewer  than  k antichains  . 
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Clearly,  a special  K-tree  is  a K-Aronszajn  tree.  The  Todorcevic  construction  also 
provides  special  K+-trees: 

7.11  Exercise.  With  the  hypotheses  of  7.10,  there  is  a special  K+-tree. 

Hint.  Modify  the  previous  construction  by  taking  instead  pp : ->  <£V  given  by 

Pp(a)  = (otlC^  Dot)  \ i < n)  , 

and  setting 

T = {pp  | (a  + 1)  | a < P < k+}  . 

By  (ii)  above  each  ~pp  is  an  injective  function,  so  that  for  s e <wk, 

As  = {f  e T | 3a,(dom(/)  = a + 1 A f(a)  = s)} 
is  an  antichain  of  {T,  C),  and  T — 1J{AS  | s e <0)k}.  H 

In  contrast,  the  first  inkling  that  cardinals  < 2K°  can  have  the  tree  property 
was  provided  by  the  following  result. 

7.12  Proposition  (Silver  [66,71]).  If  k is  real-valued  measurable,  then  k has  the 
tree  property. 

This  is  a simple  consequence  of  7.9,  as  is  later  pointed  out  in  context  (16.4(c)). 
Few  other  direct  implications  in  ZFC  about  the  tree  property  were  established  until 
the  late  1980’s.  On  the  other  hand,  consistency  results  in  the  early  1970’s  involving 
large  cardinals  have  considerably  clarified  the  possibilities  at  accessible  cardinals 
like  &>2  (see  The  Tree  Property  in  volume  II). 

Partitions  of  All  Finite  Subsets 

Erdos,  Hajnal,  and  Rado  formulated  a further  partition  relation  that  was  to  invite 
the  infusion  of  model-theoretic  techniques  more  directly  into  the  study  of  large 
cardinals.  Recall  that  for  x c On,  [x]<w  = Une® [■*]"•  The  following  partition 
relation  first  occurred  in  Erdos-Hajnal  [58:  113]: 

p — > («)r 

asserts  that  for  any  /:  [/3]<w  —>  S there  is  an  H e [/!]"  homogeneous  for  /: 
\r\Hf\  < i for  every  n e co.  In  other  words,  for  every  n.  H is  homogeneous 
for  / |[/S]"  in  the  former  sense.  For  a > co, 

the  Erdos  cardinal  k(ol)  is  the  least  X such  that  X — >•  (a)^®  . 

Implicit  in  the  use  of  this  notation  will  be  the  assumption  that  there  is  some  X 
such  that  X — > (a) J ®;  unlike  for  the  ordinary  partition  relation,  this  cannot  be 
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established  in  ZFC  (7.15(b)).  Next,  the  fixed  points  of  the  sequence  of  Erdos 
cardinals  are  specified: 


k is  Ramsey  iff  k — >•  {k)2w  . 

Thus,  Ramsey  cardinals  are  weakly  compact . Historically,  the  first  comment 
on  these  partition  relations  was  the  following,  which  showed  that  the  term  “Ram- 
sey” is  not  altogether  appropriate. 

7.13  Exercise  (Erdos-Rado  [52:435]).  u>  is  not  Ramsey. 

Hint,  f:  [&>]<<u  — »■  2 given  by  f{k\ k„)  — 0 if  k\  < n,  and  = 1 otherwise, 

has  no  infinite  homogeneous  set.  H 

The  following  proofs  are  typical  of  the  early  combinatorics: 


7.14  Proposition.  Suppose  that  a > w.  Then: 

(a)  If  a < ji,  then  K(a)  < K(f). 

(b)  K(a)  is  regular. 

(c)  k is  Ramsey  iff  for  any  y < k,  k — > (ic)Ya‘. 

Proof,  (a)  For  each  y < k (a)  let  fy:  [y]<<u  — > 2 have  no  homogeneous  set  of 
ordertype  a.  Define  g:  [ic{a)Yco  -»  2 by  g(£i,  ...,£„)  = ...,  $„-i)  (and  = 

0 for  n — 1).  If  H is  homogeneous  for  g,  then  for  y < k(oi)  with  y e If  II  D y 
is  homogeneous  for  fY  and  so  has  ordertype  < a.  Hence,  H has  ordertype  < a, 
and  so  k(<x)  < /r(yS). 

(b)  Assume  to  the  contrary  that  K(a)  is  singular.  Then  there  is  a non- 
decreasing function  h:  /r(a)  — »■  8 where  8 < k(<x)  and  |/j_1({/l)l  < K(a)  for 
each  y < 8.  Let  /:  [<5]<w  — »■  2 and  fY:  \h~x {{y})]<(°  ->  2 for  y < 8 each  have 
no  homogeneous  set  of  ordertype  a.  Define  g:  [A:(a)]<tt'  2 by 


0 if  n = 2 and  h(f\)  — h(f2)  , 

1 if  n — 2 and  h(f\)  < h(f2)  , 

fY(f  3,  • • • , %n)  if  n > 2 and  h(fx)  — . . . — h(fn)  = y , 

fm  3),  ■ • • , h(Hn))  if  n > 2 and  hfo)  <...<  h($n)  , 

0 otherwise . 


Let  H e [^(a)]"  be  homogeneous  for  g,  and  p\  and  p2  the  first  two  elements 
of  H.  Since  a > co,  H — H — {p\,  p2}  also  has  ordertype  a.  If  g“[H]2  — [0],  then 
h“H  = {/}  for  some  y < 8,  and  using  p\  and  p2  as  and  ^ 2 in  the  third  clause 
of  g it  follows  that  H is  homogeneous  for  fY,  a contradiction.  If  g“[H]2  = [1], 
then  h“H  has  ordertype  a,  and  using  p\  and  p2  as  fi  and  & in  the  fourth  clause  of 
g,  it  follows  that  /;“//  is  homogeneous  for  /,  also  a contradiction.  K(a)  is  hence 
regular. 
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(c)  Suppose  that  /:  [/c]<<u  -»  y with  y < k.  Define  g:  [/c]<<u  —>  2 
by:  g(£i,...,£„)  = 0 if  n — 2m  and  /(£  i,...,£m)  = • , h),  and 

g(t i,  ...,§„)  = 1 otherwise.  Let  //  e [a.  be  homogeneous  for  g,  and  n = 2m. 

Since  y < k,  there  must  be  s,t  e [ H ]"'  with  max(.s)  < min(r)  such  that 

f(s)  — f(t).  Thus,  g(s  U t)  — 0 and  so  by  homogeneity  g“[H]n  — {0}. 

For  any  m,  v e [//]'",  if  w e [H]'"  with  max(;^),  max(v)  < min(w),  then 
f(u)  = f (w)  — /( v).  Flence  H is  homogeneous  for  / as  well.  H 

For  cardinals  a Erdos-Flajnal-Rado  [65:  §17]  labored  with  combinatorial 
methods  to  show  that  K(a)  must  be  large,  but  far  better  results  were  soon  to 
be  achieved  using  model-theoretic  techniques.  The  following  basic  result  general- 
izing 7.14(c)  was  first  established  by  these  means;  the  proof  given  here  is  simpler 
and  based  on  an  idea  from  Baumgartner-Galvin  [78]: 


7.15  Proposition  (Silver  [66]).  Suppose  that  a > co  is  a limit  ordinal.  Then: 

(a)  For  any  y < K(a),  K(a ) — > (a)];®. 

(b)  K(a)  is  inaccessible. 

Proof,  (a)  Set  k — k(<x),  and  note  first  that  by  the  proof  of  7.14(c),  k — > 
for  any  t e co. 

Suppose  now  that  /:  [/c]<a'  — »■  y with  y < k,  and  let  g:  [y]<0>  — »■  2 have  no 
homogeneous  set  of  ordertype  a.  Define  h:  [/r]<<u  — > 4 by  setting  h(f\,  = 

0  unless  n — 2lV  for  some  i,  j > 0,  in  which  case: 


h{Sx,...,Sn) 


0 if/«i,...,&)  = /(&+!,..., hi), 

1 if /(h,...,&)>/(&+1,...,hi), 

2 if  (/ (£ik+ 1,  • • • , %uk+ 1))  I k < j)  , 

is  an  increasing  enumeration  of 
j ordinals  homogeneous  for  g , and 

3 otherwise . 


By  the  initial  remark,  there  is  an  H e [/c  ]"  homogeneous  for  h.  If  IT'\ ll]<M  — [0], 
then  H is  homogeneous  for  /,  by  the  argument  for  7.14(c).  So,  assume  to  the 
contrary  that  for  some  Ti  — 2'V , h“[H]n  f [0]  to  derive  a contradiction: 

Note  first  that  /?“[//]"  ^ [1]  also,  else  there  would  be  an  infinite  descending 
sequence  of  ordinals.  If  \ /3  < a)  is  the  increasing  enumeration  of  H,  set 
r}p  = f(X]p+v  • • ■ , fi(/>,+  Y)'i  f°r  everY  P < possible  since  a is  a limit  ordinal. 
(rjp  | P < a)  must  be  a strictly  increasing  sequence,  since  h“[H]n  f [0],  [1].  In 
particular,  for  any  natural  number  of  form  n — 2'  37  for  arbitrary  j > 0,  it  must  also 
be  the  case  that  h“[H]n  [0],  [1].  It  will  now  be  established  that  h“[H]n  = [2] 
for  every  such  n.  This  would  imply  that  {rip  \ {5  < a}  is  homogeneous  for  g,  since 
every  finite  subset  of  it  is,  contradicting  the  choice  of  g and  thereby  completing 
the  proof. 
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To  do  this  for  a given  n = 2'V  with  j > 0,  apply  Ramsey’s  Theorem  7.7 
j times  to  get  an  infinite  W c {rjp  \ < w}  homogeneous  for  every  glty]*  with 

k < j.  Let  rjpt  < . . . < rjp.  be  the  first  j elements  of  W . Then  h on  any  77-tuple 
starting  with 

^iPi+V  *’  ^”(^2+1  ’ ’ ^”((^2+1)’  1 1’  ’ 

has  value  2.  Hence,  /;“[//]"  = {2}  by  homogeneity. 

(b)  Since  /r(a)  is  regular  by  7.14(b),  it  remains  to  show  that  it  is  a strong 
limit.  But  if  A,  < k yet  2X  > k,  then  7.4  for  X would  imply  that 
contradicting  (a).  H 

In  contrast  to  7.15(b),  K(a  + n + 1)  for  a > m and  n < co  is  a cardinal 
accessible  from  K(a)\  in  the  notation  of  7.3,  K(a  + n + 1)  = beth„(/r(a))+  (S. 
Thompson  - see  Drake  [74:221]). 

The  first  inkling  that  inaccessibility  was  involved  at  all  was  a special  case 
of  7.15(b)  attributed  to  Geza  Fodor  in  Erdos-Hajnal  [58: 125].  Erdos  and  Ha- 
jnal  also  established  that  measurable  cardinals  are  Ramsey.  In  their  [62],  they 
essentially  observed  the  simple  7.14(a),  and  pointed  out  how  the  result  that  the 
least  inaccessible  cardinal  is  not  measurable  could  thus  have  been  deduced  by 
these  straightforward  means,  a couple  of  years  before  the  Hanf-Tarski  result.  This 
would  have  been  a considerable  coup  for  the  Hungarians  and  their  partition  cal- 
culus, and  one  is  left  to  speculate  on  how  it  was  missed.  Erdos  was  surely  aware 
of  the  problem  from  Erdos-Tarski  [43],  but  the  thrust  of  Erdos-Hajnal  [58]  was  in 
a different  direction,  and  the  Fodor  result  may  have  been  regarded  as  anomalous 
and  peripheral.  To  be  sure,  this  combinatorial  approach  falls  far  short  of  the  Hanf 
breakthrough,  and  does  not  even  suffice  to  establish  that  measurable  cardinals  are 
Mahlo. 

Further  showing  the  model-theoretic  approach  to  advantage  Reinhardt  and 
Silver  observed  that  even  k{co)  is  larger  than  the  least  weakly  compact  cardinal: 

7.16  Proposition  (Reinhardt-Silver  [65]).  There  is  a totally  indescribable  cardinal 
below  k(cl>). 

A proof  of  this  result  is  sketched  in  the  appropriate  setting  (9.18).  Erdos 
cardinals  have  since  been  generalized  and  shown  to  be  large  in  natural  hierarchical 
terms  (see  Subtle  Properties  in  volume  II). 

Using  normality  the  Ramsey  property  of  measurable  cardinals  was  established 
in  a strong  sense  by  Rowbottom: 

7.17  Theorem  (Rowbottom  [64,71]).  Suppose  that  k is  measurable  and  U is  a 
normal  ultrafilter  over  k.  Then  if  /:  [/r]<<u  — > y where  y < k,  there  is  a set  in  U 
homogeneous  for  f. 

Proof.  If  for  each  new  there  were  sets  Xn  e U homogeneous  for  /|  [*:]",  then 
n„ea)  X„  e U would  be  as  required.  Thus,  it  suffices  to  establish  the  following  for 
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every  n e co:  for  any  g : [a:]"  -»  y with  y < k,  there  is  a set  in  U homogeneous 
for  g. 

Proceeding  by  induction,  the  n = 1 case  is  clear  from  the  ^-completeness  of 
U.  So,  assume  that  the  assertion  holds  for  n > 1,  and  suppose  that  g:  [a]"+1  ->  y 
where  y < at.  For  each  s e [«■]"  define  gs:  k ->  y by: 

I #0  U {£})  if  max(.v)  < ft  , 

gs(P)  = 

I 0 otherwise  . 

By  /r -completeness,  for  each  s e [a-]"  there  is  a 8S  < y and  a Ys  e U such 
that  gs“Ys  = {<5V}.  By  induction  hypothesis,  there  is  a fixed  S < y and  a Z e 
U such  that  s e [Z]"  implies  that  8S  = S.  Finally,  by  /e-completeness  Za  = 
Pl{Fs  | max(s)  < a}  e U for  each  a < at,  so  that  by  normality  H — Zf\Aa<j<Za  e 
U.  The  proof  is  completed  by  checking  that  g“[^]"+1  = {^}:  Suppose  that 
t e [//]"+',  written  (=jU  \P)  where  max(i?)  < p.  Then  g(t)  — gs(P)  = 8S  — 8 
since  p e Zmax(S)  c Ys  and  s e [Z]".  H 

7.18  Corollary  (Erdos-FIajnal  [58:  125]).  Measurable  cardinals  are  Ramsey.  H 

Rowbottom  also  observed  that  the  least  Ramsey  cardinal  is  not  measurable; 
this  follows  quickly  from  a typical  indescribability  argument: 

7.19  Exercise.  If  k is  measurable  and  U is  a normal  ultrafilter  over  k,  then 

{a  < k | a is  Ramsey}  e U . 

Hint.  There  is  a n\  sentence  <p  such  that  k is  Ramsey  iff  (VK,  e)  [=  tp,  so  the 
result  follows  from  6.5.  H 

In  the  next  section  partitions  of  all  finite  subsets  are  correlated  with  model- 
theoretic  transfer  principles,  and  drastic  consequences  for  L are  deduced. 
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8.  Partitions  and  Structures 

Frederick  Rowbottom,  a student  of  Keisler  at  the  University  of  Wisconsin,  in- 
vestigated partition  properties  of  measurable  cardinals  in  his  doctoral  dissertation 
[64].  An  important  result  of  his  appeared  at  the  end  of  the  previous  section,  and 
here  we  pursue  his  main  line  of  argument,  based  on  a basic  characterization  of 
model-theoretic  transfer  principles.  With  this  conceptual  breakthrough  he  was  able 
to  derive  substantial  statements  about  the  distance  between  L and  V. 

A structure  A = (A,  R,  . ..)  for  a countable  first-order  language  with  a dis- 
tinguished unary  predicate  interpreted  by  R c A is  of  type  (k,  X)  iff  |A|  — k and 
\R\=k. 

{k,X)  —»■  in,v)  iff  whenever  A is  of  type  (k,  X), 

there  is  a B -<  A of  type  (/x,  v)  . 

(k,  X)  — » (/x,  <u>  iff  whenever  A is  of  type  (k,  X),  there  is 

a B < A of  type  (p,  p)  for  some  p < v . 

These  are  evidently  strong,  two-cardinal  versions  of  the  Lowenheim-Skolem  The- 
orem. The  transfer  principle  (at,  X)  — *►  (/x,  v)  is  known  as  Chang’s  Conjecture 
for  the  pairs  (at,  X),  ( /x , v),  after  the  attribution  to  Chen-Chung  Chang  in  Vaught 
[63:  309].  A test  case  was  soon  singled  out: 

Chang’s  Conjecture  is  ( a>2 , co\)  — » {c o\ , co)  . 

Of  course,  this  is  equivalent  to  (a>2,a>\)  — »■  {(0\ , <co\)  since  an  infinite  unary 
predicate  cannot  have  a finite  counterpart  in  any  elementary  substructure.  Gener- 
ally speaking,  the  less  stringent  (p,  <v)  formulation  is  more  immediate  for  large 
cardinal  postulations,  but  further  analysis  can  produce  a range  of  ( /x , v)  conclu- 
sions (cf.  8.5(b)). 

Rowbottom  established  an  equivalence  between  the  model-theoretic  con- 
cept and  partition  relations  somewhat  different  from  the  types  that  have  been  dis- 
cussed. In  a nice  reversal,  this  major  application  preceded  the  introduction  of  the 
partition  symbol: 

First,  the  square  brackets  partition  relation 

P — > [a]rs 

of  Erdos-Flajnal-Rado  [65: 144]  asserts  that  for  any  /:  [f]Y  -»  S there  is  an 
H e [/8]“  such  that  f“[H]Y  i £ That  is,  / on  [ H]Y  omits  at  least  one  value, 
a far  weaker  conclusion  than  for  the  ordinary  partition  relation.  As  with  that 
relation  the  focus  is  mostly  on  the  case  y < a>,  although  Erdos-Hajnal  [66]  was 
able  to  establish  with  a clever  use  of  the  Axiom  of  Choice  that  k— [at]®  for 
any  k (23.13),  a result  that  was  to  impose  an  ultimate  limitation  on  large  cardinal 
hypotheses.  The  main  incentives  in  the  study  of  square  bracket  relations  lie  in  the 
investigation  of  their  possible  negations  as  strong  combinatorial  propositions  (see 
Erdos-Flajnal-Mate-Rado  [84]  and  Todorcevic  [87]). 
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The  version 

P — ► [«]?,<„ 

asserts  that  for  any  /:  [/0]y  ->  S there  is  an  H e [/!]“  such  that  \f“[H~\Y\  < rj. 

Finally,  as  with  the  ordinary  partition  relation  Erdos-Hajnal-Rado  [65:  156] 
considered  stronger  forms: 

P — ► [a]f“ 

asserts  that  for  any  /:  [/S]<fU  ->  S there  is  an  H e [/3]“  such  that  /“[//]<w  f 
and 

p — > \c<, 

has  the  analogous  meaning. 

The  basic  Rowbottom  discovery  can  now  be  stated: 

8.1  Theorem  (Rowbottom  [64,71]).  Suppose  that  k > X and  k > p > v > a>. 
Then  the  following  are  equivalent: 

(a)  {k,  X)  — »■  (p,  < v). 

(b)  K ->  [/*]£"„. 

Proof.  In  the  forward  direction,  suppose  that  /:  [*r]<<w  ->  X.  Consider  the  structure 

A=(K,X,e,f  |[/c]">n€(U  . 

By  hypothesis,  there  is  an  H e [ac]^  with  |A.  (T  H\  < v such  that 
{H,  Xn  H,e,  f\[H]")neoi  -<  A . 

Clearly,  this  H is  as  desired. 

For  the  converse,  we  Skolemize  of  course.  Suppose  that  A — (A.  R.  . . .)  is 
a structure  of  type  (k,  X);  it  can  be  assumed  by  relabeling  that  A = k and  R = X. 
Let  {hn  | n e coj  be  a complete  set  of  Skolem  functions  for  A (§0),  say  with  hn 
k(n)- ary  where  k(n)  < n.  Define  /:  [/c]<w  ->  X by 

!hn(f\ . . . . , ^k(n))  if  this  is  less  than  X , and 

0 otherwise . 

By  (b),  let  H e [/r]M  be  such  that  |/“[//]<w|  < v,  and  set  B = [Jn  h„“[H]kM. 
Then  |B|  = p and  B is  the  domain  of  a structure  {B,  X fl  B. . . .)  -<  A.  Also, 
| A.  D B\  < u as  A D B C.  /“[//]<®,  and  the  proof  is  complete.  H 

It  is  a measure  of  modem  sophistication  that  this  characterization  is  nowadays 
regarded  as  entirely  straightforward,  with  the  two  statements  considered  synoptic; 
of  course,  we  are  standing  on  the  shoulders  of  Skolem  [23]. 

Some  forms  of  these  transfer  principles  have  simpler  combinatorial  equiva- 
lences. In  fact,  before  Erdos,  Hajnal,  and  Rado  developed  their  notations,  they  had 
pondered  ([65:  154])  the  relation  a> 2 — > [&u ]«,.<&),  in  connection  with  a problem 
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of  Ulam,  a relation  that  turns  out  to  be  equivalent  to  Chang’s  Conjecture.  As  an 
aside,  a generalization  of  this  is  established: 

8.2  Proposition  (Erdos-Hajnal  [74:  275]  for  n — 2).  For  1 < n < a>, 

OF  — ► iff  a>„  — > ■ 

In  particular,  Chang’s  Conjecture  holds  iff  co 2 — > [w  1]^  . 

Proof.  The  following  will  be  established  for  new  by  induction: 

(*)  For  any  /:  [ w„]<a>  — »■  tt>i,  there  is  a g:  [«„]"  — >■  a>i  such  that:  for  any 
5 e [w„]<m  with  |s|  > n,  there  is  a t e [s]"  satisfying  max(/“[.v]<<B)  < 
g(t)- 

The  proof  would  then  be  complete,  since  given  such  / with  corresponding  g,  if 
H c &>„  and  g“[H]n  is  countable,  then  sup (f“[H]<ro)  < sup(g“[//]'!)  < w\  and 
so  f“[H]<0>  is  also  countable. 

(*)  for  n — 0 is  immediate,  as  [tt>o]°  = {0}  and  so  g(0)  = sup(ran(/))  works. 
Suppose  next  that  (*)  holds  for  n,  and  /:  [wn+\\<‘°  — >■  w>\.  For  a < wn+\  let 
pa'.  a — > wn  be  injective,  and  define  fa\  [®„]<®  —>  w\  by 

fa(x)  = max( f“[p~\x)  U [o;}]<a>)  . 

By  the  induction  hypothesis,  let  ga:  [ w,,]"  ->  w\  verify  (*)  for  fa.  Finally,  define 
g-  K,+i]"+1  ->  o>\  by 

g(t)  = ga(Pa“(t  ~ {«}))  where  a = max (7)  . 

If  |s|  >n  + l where  a — max(^),  then 

max(/“[i]<w)  = fa(pa“(s  - {a}))  < ga(y)  for  some  y e [pa“(s  - {a})]" 

= g(t)  where  t = p~\y)  U {a}  , 

and  so  the  proof  is  complete.  H 

The  exponent  here  is  best  possible  in  the  sense  that  with  GCH,  &>„  — >- 
by  the  Erdos-Rado  Theorem  7.3.  Todorcevic  [94]  established  in  1991 
that  Chang’s  Conjecture  is  in  fact  equivalent  to  u> 2 — > [<^1]®!;  the  exponent 
here  is  best  possible  in  the  sense  that  with  CH,  7.3  implies  that  &>2  — > (“llL 
and  so  o>2  — > Silver  established  in  1967  that  if  there  is  a k satisfying 

k — > (ujijj®,  then  there  is  a forcing  extension  in  which  Chang’s  Conjecture 
holds  (see  Kurepa’s  Flypothesis  and  Chang’s  Conjecture  in  volume  II). 

Rowbottom  Cardinals 

The  conspicuous  feature  of  measurable  cardinals  that  concerned  Rowbottom  is  the 
following.  For  w < v < k. 
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k is  v-Rowbottom  iff  k — > for  any  X < k , and 

k is  Rowbottom  iff  k is  to\  -Rowbottom  . 

By  7.14(c),  a Ramsey  cardinal  is  Rowbottom. 

Gaifman  (see  9.1)  and  Rowbottom  independently  and  almost  concurrently 
established  the  first  informative  structural  results  about  the  incompatibility  of  large 
cardinals  with  V — L.  Whereas  both  Scott  (5.5)  and  Gaifman  made  global  use  of 
ultrapowers,  Rowbottom  showed  how  strong  local  conclusions  can  be  derived  from 
partition  properties  alone.  The  following  theorem  typifies  his  results;  although 
they  were  soon  subsumed  by  the  definitive  work  of  Silver  (§9),  Gaifman  and 
Rowbottom  were  first  to  make  such  remarkable  statements  about  L and  the  low 
levels  of  the  cumulative  hierarchy.  The  proof  is  notably  short,  benefiting  somewhat 
from  hindsight: 

8.3  Theorem  (Rowbottom  [64,71]).  Suppose  that  there  are  k > X > to  and 
k > n > to  such  that  (k,  X)  — (/x,  < oof).  (For  instance,  suppose  that  there 
is  a Rowbottom  cardinal,  or  there  is  a k satisfying  k — > (ft>i)J®,  or  Chang’s 
Conjecture  (a>2,  to i)  — » {to i,  to)  holds.)  Then  to\  is  inaccessible  in  L,  and  so  in 
particular  V(to)L  is  countable. 

Proof.  It  suffices  to  show  for  any  a < to\  that  V(a)L  is  countable,  for  then  an 
being  regular  must  be  inaccessible  in  L. 

Given  such  an  a,  let 


•Xl  — (Lk,  ft)  fi<a  i 

a structure  in  a countable  language.  By  hypothesis,  there  is  a 

B = {X,  R,  e,  f)p<a  -<  A 

with  |X|  = /x  and  \R\  < a>\.  By  0.4,  B has  a transitive  collapse  T,  and  by  3.3(a), 
T is  of  form 

T — (Ly,  S , €,  f) p<a 

for  some  y > gt.  Let  i be  the  inverse  of  the  collapsing  isomorphism,  so  that 
i:  T -<  A with  iff)  = f for  ft  < a.  If  x e V(a)  DLy,  this  implies  that  i(x)  — x, 
and  since  V(a)L  c Lm  c Lx,  it  follows  that  x e 5 by  the  elementarity  of  i.  But 
also  V(ol)  fl  Ly  = V{o/)L  since  y > /x  > to,  and  so  V(a)L  C S and  is  therefore 
countable.  H 

Historically,  Rowbottom’ s first  insight  was  the  conclusion  that  any  ordinal 
definable  in  L (like  \V(to)\L , \V(V(to))\L,  and  so  forth)  is  countable. 

As  one  might  have  guessed,  the  restriction  to  countable  languages  in  8.1 
and  8.3  is  not  essential.  Rowbottom  made  a general  statement  along  the  lines  of 
8.1  about  getting  homogeneous  sets  simultaneously  for  collections  of  functions 
corresponding  to  many  Skolem  functions,  and  then  observed  that  Ramsey  cardi- 
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nals  provide  the  requisite  strength.  The  following  are  useful  variants  with  single 
functions: 

8.4  Theorem  (Rowbottom  [64,71]).  Suppose  that  (A,  R, . . where  R c A,  is 
a structure  for  a (first-order)  language  with  less  than  v non-logical  symbols,  and 
\A\>  k and  |R|  = X < k.  Assume  that  either 

(a)  X<v  < k and  k is  v-Rowbottom,  or 

(b)  X — v is  regular  and  k — > Mv  <v- 

Then  there  is  a ( B , R fl  B, . . .)  -<  (A,  R,  . . .)  such  that  \B\  = k and  fl  B\  < v. 

Proof.  Assuming  that  A 3 k and  R — X by  relabeling  and  following  the  proof 
of  8.1,  let  {ha  | a < p}  be  a complete  set  of  Skolem  functions  for  the  language, 
where  p < v and  ha  is  k(ot)- ary. 

First  assume  (a).  Let  / be  the  function  on  [/r]<<u  defined  by 

/(I  l ,$„)  = {K($  i,  \ a < p A n = k(a)  A ha(fi\ ?„)  e A.}  . 

Thus  |ran(/)|  < Xp,  and  so  there  is  an  H e [/c]K  such  that  \f“[H]<0)\  < v.  Set 
B = Ua<p  Then  (B,  XHB, ...)  < (A,  X, . . .),  and 

ia.  n b\  < vriHrn  < p ■ \rwrn  < v . 

Assuming  (b)  instead,  define  / on  [k]<<0  from  the  / above  by 
7(§i,...,£„)  = sup(/(fi,...,£„)). 

Then  ran(/)  c v by  the  regularity  of  X — v.  The  argument  can  now  be  completed 
as  before,  invoking  the  regularity  of  v again  to  insure  for  the  corresponding  B 
that  | A fl  B\  < v.  -\ 

8.5  Corollary.  Suppose  that  k is  a Ramsey  cardinal.  Then: 

(a)  For  any  infinite  a < k,  \V(a)L  \ = \a\,  and  so  every  uncountable  regular 
X < k is  inaccessible  in  L. 

(b)  For  any  v < X < at,  (k,  X)  (k,  v). 

Proof,  (a)  This  follows  from  the  proof  of  8.3  using  uncountable  languages. 

(b)  Given  a structure  (A,  R, . . .)  of  type  {k,  X)  (in  a countable  language),  let 
S e [R]v  and  consider  the  expansion  {A,  R, , x)x€s.  Applying  8.4  with  its  v 
replaced  by  v+,  this  expansion  has  an  elementary  substructure 

(B,  R fl  B,  . . . , bx)x€s 

such  that  \B\  — k and  \R  IT  B\  < v.  But  then,  \R  fl  B\  — v because  of  the  bx’s, 
and  the  reduct  (B.  R fl  B. . . . ) to  the  original  language  is  of  type  {k,  v).  H 

The  postulation  of  Rowbottom  cardinals  marked  a historic  reversal,  in  that 
it  was  designed  for  strong  implications  like  8.3  but  does  not  appear  to  have  sub- 
stantial size  consequences.  The  only  evident  constraint  is  the  following: 
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8.6  Exercise.  For  any  X, 

(a)  ^+— fi 

(b)  X — k [A.]*f(A)  <cf(1). 

Hence,  a v-Rowbottom  cardinal  k is  either  weakly  inaccessible  or  has  cofinality 
less  than  v. 

Hint,  (a)  For  each  B < X+  let  fg:  B —>  X be  injective,  and  consider  e:  [A+l2  —>  X 
defined  by:  g(a,  /))  — fp(ot). 

(b)  Let  (ya  \ a < cf (at))  be  cofinal  in  k,  and  consider  h:  X — > cf(A)  defined 
by:  h (f ) = the  least  a such  that  f < ya . H 

Karel  Prikry  showed  that  singular  cardinals  can  exhibit  strong  partition  prop- 
erties: 

8.7  Theorem  (Prikry  [70]).  Suppose  that  k is  a singular  limit  of  measurable 
cardinals.  Then  k is  cf(Ar)+ -Rowbottom. 

Proof.  Set  v — cf (k),  and  let  (Ka  \ a < v)  be  the  increasing  enumeration  of  a set 
closed  unbounded  in  k with  kq  — v and  /cff+i  measurable  for  each  a < v.  Let 
Ua+ i be  a normal  ultrafilter  over  /<■„+!  for  a < v,  and  define  D by: 

X e D iff  X c.  k A < vVaf3  < a < v — > X fi  Ka+i  e Ua+i)  , 

so  that  D is  a uniform  filter  over  k.  For  .y  e [k]<oj  set 

type(i')  = {(a,  k)  € v x w \ |s  fl  (Arff+1  — Ka)\  = k > 0}  . 

The  following  will  be  established: 

(*)  For  any  /:  [at] <co  -»  y with  y < k there  is  an  H e D such  that: 
for  any  e [H]<a>  with  type(s0)  = type(ii),  f(s0)  = f(sfi  . 

With  at  most  v possibilities  for  typefs)  it  will  follow  that  \f“[H]<a>\  < v,  and 
hence  that  k is  u+-Rowbottom. 

Consider  for  new  the  assertion: 

(*)„  For  any  /:  [at]<<1j  -*  y with  y < k there  is  an  H C k such  that: 

(i)  for  every  /cff+i  > y,  H IT  k„+i  e Ua+i  ■ and 

(ii)  for  any  so,  e [H]<a>  with  type(so)  = type(.vi)  and 

|type(^0)|  = n,  f(s0 ) = /(si)  . 

If  /:  [/r]<w  —>  y with  y < k and  for  each  new  there  is  an  Hn  satisfying  (*)„  for 
/,  then  by  (i),  P|n  H„  e D , and  by  (ii),  (*)  is  satisfied  with  H = H„.  Thus, 

it  suffices  to  establish  (*)„  for  every  n e w by  induction: 

(*)o  holds  vacuously,  so  proceeding  to  the  inductive  step,  suppose  that  (*)„ 
holds,  and  let  /:  [/c]<<u  ->  y with  y < k.  For  a < v with  > y and 

t e [/Ca]<a\  define  f,:  [Ar„+i]<<u  — >■  y by:  ft(u ) = f(t  U u ).  By  7.17  there  is  an 
X,  e Ua+ 1 homogeneous  for  ft.  For  a < v with  k„+  i > y,  set 
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xa  = 1 1 e k]<w}  - Ka 


so  that  Xa  e Ua+ 1 by  *:„+! -completeness. 

Next,  for  any  (a,  k)  e v x a>  define  fay.  [ k~\<0}  ->  y by 


fc,k(t) 


f(t  U u)  if  t e [ku\<0>,  where  u is  some 
(any)  member  of  [ X „]*  , and 

0 otherwise  . 


fa.k  is  well-defined  by  definition  of  Xa.  Let  Ha  k c k be  for  fa  k as  provided  by 
(*)/!• 

Finally,  set 


H = I 0/  < V A Y < Ka+  1}  n n {Ha.k  I (a,  k)  e V x co)  . 

Then  for  Ka+\  > y,  H C ^ Ka+\  e Ua+i  by  v+-completeness.  Suppose  now  that 
so,  si  e [H]<0i  with  type(so)  = type(si)  and  |type(so)|  =n  + 1.  Let  a be  maximal 
such  that  (a,  k)  e type(so)  for  some  k,  and  set  s0  = t0  U uq  and  si  = t\  U u\  where 
type(fo)  = type (t i ) with  |type(to)|  = n and  type(«o)  = type(Mi)  = {(a,  k)}.  Then 
/(so)  = fa.k(t0)  — fa,k(h)  = /(Si),  confirming  (*)n+i  for  /.  H 


Prikry  also  showed  that  a measurable  cardinal  can  be  maintained  a Rowbot- 
tom  cardinal  while  changing  its  cofinality  to  a>  by  forcing  (18.6).  However,  the 
following  is  a major  question  that  has  remained  impenetrable. 

8.8  Question.  Can  be  Rowbottom? 

For  related  results,  see  8.16  and  remarks  surrounding. 

Several  observations  about  how  v-Rowbottom  cardinals  constrain  power  set 
cardinalities,  although  not  difficult,  were  made  considerably  later. 

8.9  Proposition  (Prikry  [75]).  Suppose  that  2<y  = k and  k is  a regular  v- 
Rowbottom  cardinal.  Then  for  v < X < k,  2x  — k. 

Proof.  As  v < k — cf (k),  there  is  a p.  < v such  that  2;<  = k.  Proceeding  by 
induction,  suppose  then  that  X < k and  for  p < X'  < X,  2}  — k.  If  A is  singular, 
a well-known  argument  establishes  that  2X  = k:  Let  (ya  \ a < cf(A))  be  cofinal 
in  X,  and  for  X c X let  fx  on  cf(A)  be  defined  by:  /x(a)  = X (T  ya.  Then  X fY 
implies  that  fx  f /y,  and  so  2X  < (2<A-)cf(A)  = 2<x  = k. 

Suppose  next  that  X is  regular.  Similarly  considering  fx  on  X for  X c X 
defined  by  fx(oi)  — X (T  a,  2<x  — k implies  that  there  is  a family  T of  2X 
functions:  X -a  k that  are  almost  disjoint,  i.e.  for  / / g both  in  T , /( a)  f g(a) 
for  sufficiently  large  a < X. 

Assume  now  to  the  contrary  that  2 x > k.  Since  k is  regular,  for  each  fed7 
there  is  a Sf  < k such  that  ran (/)  C 8f.  Consequently,  there  is  a S < k and 
a Q C T with  \Q\  = k such  that  / e Q implies  that  8f  — 8.  Enumerating 
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Q as  {/|  | § < k),  define  F:  [/c]2  ->  X by:  F(§ , £)  = the  least  ft  such  that 
fa  (a)  ^ /f(a)  for  /)  < a < A.  Since  k is  v-Rowbottom,  there  is  an  H e [*:]* 

such  that  |F“[//]2|  < v.  Now  X is  regular,  and  the  assumption  2X  > k together 

with  the  hypothesis  2<v  = k implies  that  v < X,  and  so  y — sup(F“[//]2)  < X. 

But  then,  {/|(y)  | % e H)  consists  of  k ordinals  all  less  than  X,  which  is  a 

contradiction.  H 

It  follows  that  if  2V  is  v+ -Rowbottom,  then  for  v < X < 2V,  2X  = 2V . The 
following  result  complements  8.9. 

8.10  Proposition  (Tryba  [86]).  Suppose  that  2<v  < k,  and  k is  v-Rowbottom. 
Then  k is  a strong  limit  cardinal. 

Proof  Assume  to  the  contrary  that  X < k yet  2 x > k.  Arguing  as  for  8.1,  let 
<M,e>  -<  (VK+l0,e)  be  such  that  \M\  — k — \M  fl  k\  and  \M  fl  A]  < v.  Let 
N be  the  transitive  collapse  of  M,  and  j : ( N , e)  -<  (K+®,  e)  the  inverse  of  the 
collapsing  isomorphism,  so  that  j(jc)  = k and  j(S)  — X for  some  S < v.  By 
assumption  there  is  an  injection:  k ->  V{X)  in  VK+to,  and  so  there  is  an  injection: 
k —*■  V(S)  in  N.  But  this  contradicts  the  hypothesis  2<v  < k.  H 

In  particular,  if  cow  is  Rowbottom  and  2s'0  < coa,,  then  a>[0  is  a strong  limit  car- 
dinal. Further  results  along  these  lines  appear  in  Tryba  [86].  A similar  argument 
establishes  the  following: 

8.11  Exercise  (Tryba  [81]).  If  k is  v-Rowbottom  and  there  is  a limit  cardinal  p 
such  that  v < p < k,  then  k is  a limit  of  limit  cardinals. 

Hint.  Assume  to  the  contrary  that  X is  the  largest  limit  cardinal  less  than  k.  Let 
j:  (A,  e)  -<  {VK+l0i  e)  be  as  in  the  proof  of  8.10  for  this  X.  Now  note  that  p is  a 
limit  cardinal  in  M to  derive  a contradiction.  H 

It  follows  that  cOio+a,  is  not  Rowbottom,  and  in  fact  the  least  Rowbottom  car- 
dinal > coco  is  > &)«.<«.  Hypothesizing  that  p is  an  inaccessible  cardinal,  a Mahlo 
cardinal,  etc.,  lead  to  analogous  results  about  the  size  of  k.  Also,  the  same  means 
establish  that  if  k is  v-Rowbottom  and  cov  < k,  then  k — a>K. 

Little  else  is  known  about  v-Rowbottom  cardinals  combinatorially  or  consist- 
ency-wise which  is  not  already  a consequence  of  a related  concept,  to  which  we 
now  turn. 

Jonsson  Cardinals 

An  initiative  from  a different  quarter  completed  the  transition  from  the  study 
of  ordinary  partition  relations  to  the  consideration  of  an  intrinsically  interesting 
algebraic  problem.  Bjami  Jonsson,  a student  of  Tarski  and  a prime  exponent  of 
transfinite  universal  algebra,  posed  in  1962  the  following  problem  (see  Jonsson 
[72:  3.9]  for  the  context):  For  a cardinal  k,  does  every  algebra  of  cardinality  k 
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have  a proper  subalgebra  of  the  same  cardinality?  Here,  an  algebra  is  a structure 
A — (A,  f„)nei,i  where  for  each  n,  fn:  [A]k(-n)  — > A for  some  k(n)  e co;  and  a 
subalgebra  of  it  is  a structure  of  form  Ao  — {Aq,  fi,\[Ao]k(n))n€w  where  Ao  C 
A and  /«“[Ao]a'('!)  c Ao.  Thus,  these  are  standard  model-theoretic  notions  for 
a countable  language  with  only  function  symbols.  Jonsson  queried  specifically 
about  algebras  with  only  finitely  many  operations,  but  the  difference  is  inessential 
because  of  coding.  A Jonsson  algebra  is  an  algebra  without  a proper  subalgebra 
of  the  same  cardinality.  A Jonsson  cardinal  is  one  that  has  no  Jonsson  algebras 
of  that  cardinality,  or  stated  affirmatively, 

k is  Jonsson  iff  every  algebra  of  cardinality  k has  a 

proper  subalgebra  of  the  same  cardinality  . 

Jonsson’s  problem  is  thus:  Are  there  any  Jonsson  cardinals? 

As  with  Rowbottom  cardinals,  Jonsson  cardinals  were  characterized  combi- 
natorially  and  model-theoretically: 

8.12  Exercise  (Erdos-Hajnal  [66]  for  (b),  Keisler-Rowbottom  [65]  for  (c)).  The 
following  are  equivalent: 

(a)  k is  Jonsson. 

(b)  K — ► [*]<“. 

(c)  Any  structure  for  a countable  first-order  language  with  domain  of  cardi- 
nality k has  a proper  elementary  substructure  with  domain  of  the  same  cardinality. 

Hint.  For  (a)  ->  (b),  if  /:  [Ar]<<u  —*■  k,  consider  the  algebra  A = (k,  f\[K]")nea>- 
(b)  -*  (c)  is  as  for  8.1.  H 

Keisler  and  Rowbottom  thus  saw  that  if  k is  v-Rowbottom  for  some  v < k, 
then  k is  Jonsson . By  Skolemizing,  they  were  able  to  incorporate  predicates  and 
elementary  substructures  into  Jonsson’s  problem,  and  establish  an  extension  of 
Scott’s  original  result  on  L: 

8.13  Proposition  (Keisler-Rowbottom  [65]).  If  there  is  a Jonsson  cardinal,  then 
V L. 

Proof.  Let  k be  Jonsson,  and  by  8.12(c)  let  ( B , e)  -<  ( LK , e)  with  \B\  — k and 
B f Lk.  By  0.4,  ( B , e)  has  a transitive  collapse,  and  by  3.3(a),  it  must  be  (LK,  e). 
Let  i:  Lk  -<  LK  then  be  the  inverse  of  the  collapsing  isomorphism.  Since  B LK, 
i cannot  be  the  identity  and  so  has  a critical  point  S by  the  argument  for  5.1(b). 
Now  define  U by: 


X e U iff  X e V(S)L  a S e i(X)  . 

Since  V(S)L  C L$+  C LK,  if  V = L,  then  (cf.  5.6)  U is  a 5-complete  ultrafilter 
over  S and  hence  that  <5  is  a measurable  cardinal.  But  this  contradicts  Scott’s  result 
5.5.  H 
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The  conclusion  here  is  weaker  than  for  8.3,  but  in  any  case  these  results 
were  soon  to  be  subsumed  by  the  incisive  work  of  Kunen  (21.4)  who  showed  that 
the  strongest  conclusions  in  this  direction  can  be  derived  from  these  hypotheses. 
He  also  showed  (20.23)  that  it  is  consistent  with  the  existence  of  a measurable 
cardinal  that  a cardinal  is  Jonsson  iff  it  is  Ramsey.  However,  even  less  is  known 
in  ZFC  about  the  size  of  Jonsson  cardinals  than  of  Rowbottom  cardinals,  and 
this  has  become  a major  area  of  investigation;  the  initial  observations  were  the 
following: 

8.14  Proposition. 

(a)  oo  is  not  Jonsson. 

(b)  (Chang,  Rowbottom,  Erdos-Hajnal  [66])  If  k is  not  Jonsson,  neither  is 

K+ . 

(c)  (Erdos-Hajnal-Rado  [65:  145])  If  2K  = k+,  then  k+  —fi  [/c+]2+  and  hence 
k+  is  not  Jonsson. 

(d)  (Rowbottom  - Devlin  [73:311])  The  least  Jonsson  cardinal  is  either 
weakly  inaccessible  or  singular  of  cofinality  oo. 

Proof,  (a)  If  / e “&>  is  defined  by  /( 0)  = 0 and  f(n)  = n — 1 otherwise,  then 
(oo,  f)  is  a Jonsson  algebra. 

(b)  For  k < a < k+ , using  a bijection  let  fa  witness  a— fir  [/c]^M.  Define 
g:  [K+]<a)  ->  k+  by:  g(s ) = fa (s  — [a])  if  a — maxfy)  > k,  and  g(s)  = 
0 otherwise.  Then  g witnesses  /r+— /*[k+]<®:  If  X e [at+]'c+  and  p < k+ 
is  arbitrary,  let  a e X such  that  ft  < a and  |X  (T  ct|  = k.  Then  for  some 
t e [X  n a]<<0,  p = fa(t ) = g(r~{a}). 

(c)  Using  2K  = k+  let  { Xa  \ k < a < *c+}  = [^+]'f  with  Xa  C a.  Then  there 
is  an  /:  [/c+]2  ->  k+  so  that  for  any  k < P < k+ , 

(*)  if  k < a < p and  < p,  there  is  a f such  that  /(f , /!)  = ?;. 

To  show  this,  for  such  p let  {(a,-,  ?;,■}  i < k]  = {(a,  p)  | k < a < P A r]  < P), 
recursively  define  \ i < k}  so  that  e Xaj  — [f7-  | j < ;},  and  then  stipulate 
that  / satisfy  /(§,,  P)  = r)t  for  each  i < k. 

To  verify  that  / witnesses  k+— fir  [k+]2+,  let  S e [/r+]'c+  and  ?]  < k+  be 
arbitrary.  Take  any  a < k+  so  that  Xa  C S,  and  let  p e S — (max([i7,  a})  + 1). 
Then  (*)  implies  that  there  is  a f € Ia  such  that  /(£,  P)  = 

(d)  Suppose  that  k is  the  least  Jonsson  cardinal.  Then  k is  an  uncountable 
limit  cardinal  by  (a)  and  (b),  so  assume  to  the  contrary  that  oo  < cfiV)  = X < k. 
Let  (na  | a < X)  be  an  increasing  sequence  of  cardinals  closed  and  unbounded 
in  k with  X < /r o-  For  each  a < X let  fa  witness  na—fi>-  [jJ-aVff,  and  define 
/:  [k]<w  -»  k by: 


ifa(s  — [a])  if  a — min(s)  < X and  max(s)  < , and 

0 otherwise . 
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Also,  let  g:  [k]<co  — »■  k be  any  extension  of  a function  witnessing  X— />[L]]j®, 
and  let  h:  k ->  X be  defined  by  h(^)  = that  a such  that  na  < i;  < \jl a+i . 

We  shall  show  that 


{K,f\[K]\g\[K]n,h)neco 

is  a Jonsson  algebra,  arriving  at  a contradiction.  So,  suppose  that  X e [k]'1  is  the 
domain  of  a subalgebra.  \X  n A]  — X because  of  h,  and  so  X fi  X — X because 

of  g.  Assume  now  that  f < k is  arbitrary.  Let  a0  < X such  that  § < fjtao, 

and  by  recursion  choose  an  < an+\  < X such  that  \X  n fia„+l\  > da„-  Set 

ft  = sup([a„  | n e co})  so  that  /l  < X as  X — cffic)  > a>.  Then  \X  fl  (ip\  — [ip 

since  the  /u„’s  form  a closed  set  of  ordinals.  Hence,  fp“[(X  fl  fip)  — A]<<u  = \xp, 
and  so  by  definition  of  / and  the  fact  that  /3  e X = X C \X,  % e X.  Hence  X — k, 
and  with  this  contradiction  the  proof  is  complete.  H 

The  proof  of  (b)  was  considerably  elaborated  by  Shelah  [80]  to  show  that 
there  is  a Jonsson  algebra  of  cardinality  Ki  which  is  a group,  answering  an  old 
question  of  Kurosh. 

Eugene  Kleinberg  observed  that  Jonsson  cardinals  have  Rowbottom-like  prop- 
erties: 

8.15  Proposition  (Kleinberg  [73]). 

(a)  If  k is  Jonsson,  then  there  is  an  a < k such  that  k — > Ma®-  If  8 is  the 
least  such  a,  then  8 is  regular  and  k — >■  [^]^<^. 

(b)  If  k — > and  X is  not  Jonsson,  then  k — >•  [/r]][<^. 

(c)  The  least  Jonsson  cardinal  k is  v-Rowbottom  for  some  v < k;  in  fact,  v 
can  be  taken  to  be  the  8 of  (a)  for  the  Jonsson  cardinal. 

Proof,  (a)  Assume  that  for  each  a < k,  there  is  an  fa:  [k]<<u  — > a witnessing 
Then  it  is  simple  to  check  that  /:  [/c]<<u  -*  k defined  by  f(s)  — 
fa(s  — [a])  where  a — min(.?)  (and  /(0)  = 0)  witnesses  k— />[k],7®,  which  is  a 
contradiction. 

To  verify  the  assertion  about  8,  it  suffices  to  show  that  k — > Mens)  <cf(8)- 
So,  assume  to  the  contrary  that  this  has  a counterexample  g.  Let  e:  cf(5)  — >■ 
8 be  cofinal,  and  fa  for  a <8  as  above.  Define  h:  [k]<<w  -»  8 by  setting 
h(^ i,  . . . , = 0 unless  n — 2'V  for  some  i,  j > 0,  in  which  case 


h(M  1 Hn)  = /e(g({1,...,&))(?i+ li  • • • . Hi+j)  ■ 

Then  it  is  straightforward  to  check  that  h witnesses  tc—j ^[a:]^®,  which  is  a con- 
tradiction. 

(b)  We  shall  verify  that  ( k,X ) — (k,  <X)  (cf.  8.1).  So  suppose  that  S = 
(A,  R, . ..)  is  a structure  of  type  (at,  X)  where  it  can  be  assumed  that  A — k and 
R — X,  and  let  g be  any  function:  [/c]<®  ->  k extending  a function  witnessing 
X— ■f*[X\£a> . Expand  S to  a structure  A — (k,  X, . . . , g| [*:]"}„<=<«,  and  define  a 
function  /:  [/r]<w  ->  X just  as  for  8.1  from  a complete  set  of  Skolem  functions 
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\h„  | n e &)}  for  A.  By  hypothesis  there  is  an  H e [k~\k  such  that  f“[H]<a>  f X. 
Set  A0  = U„  hn“[H]kM,  so  that 

Aq  — (Aq,  X n Ao, . . . , g|[Ao]'!>„€(U  -<  A . 

If  |lfl  Ao|  = X , then  A.PI  Ao  = X because  of  g,  contradicting  f“[H]<a>  f X.  Hence, 
the  reduct  of  Ao  to  the  original  language  without  g is  an  elementary  substructure 
of  S of  type  ( k , p)  for  some  p < X. 

(c)  This  follows  in  a straightforward  manner  from  (a)  and  (b).  H 

By  8.14(a)(b),  is  ostensibly  the  least  possibility  for  a Jonsson  cardinal. 
Whether  this  can  be  realized  is  unresolved,  and  is  essentially  the  same  question 
as  8.8  by  8.15(c)  and  the  proof  of  the  next  result,  which  will  be  deferred  until  our 
forcing  context  has  been  established  (10.18). 

8.16  Theorem  (Kleinberg  [72,79]).  The  following  theories  are  equiconsistent: 

(a)  ZFC  + 3 k{k  is  Rowbottom). 

(b)  ZFC  + 3 k(k  is  Jonsson). 

Devlin  [73]  also  observed  with  simple  forcing  arguments  that:  (i)  Con  (ZFC 
+ 3 k{k  is  Jonsson))  implies  Con(ZFC  + 3 k(k  is  Jonsson  A k < 2X°)),  and  (ii) 
Con(ZFC  + 3 k{k  is  Ramsey))  implies  Con(ZFC  + 3 k(k  is  Jonsson)  + rthe  least 
Jonsson  cardinal  is  not  Rowbottom1).  Then  Silver  in  1974  established  with  an 
elegant  argument  that  if  toco  is  Jonsson  and  2K°  < then  <x>m  is  measurable  in 
an  inner  model  (see  Kanamori-Magidor  [78:  129ff]  for  a proof). 

After  this  work,  save  for  a paper  Shelah  [78]  there  was  a hiatus  in  the  study 
of  Jonsson  cardinals  per  se  until  a resurgence  of  interest  in  the  1980’s.  Then 
developments  in  inner  model  theory  established  that  the  existence  of  accessible 
Jonsson  cardinals  has  substantial  large  cardinal  consequences.  Extending  results 
of  Donder-Koepke  [83],  Koepke  [88]  showed  for  example  that  if  there  is  a Jonsson 
cardinal  k such  that  k — for  some  $ < k or  k is  regular  but  not  K-Mahlo,  then 
for  any  X there  is  an  inner  model  with  X measurable  cardinals.  He  also  showed 
that  Prikry’s  8.7  is  sharp  in  consistency  strength  for  uncountable  cofinalities:  if 
there  is  a Jonsson  cardinal  k such  that  u>  < cf(/r)  < k,  then  there  is  an  inner  model 
with  cf(/r)  measurable  cardinals.  Then  Koepke  [89]  established  that  the  existence 
of  a successor  Jonsson  cardinal,  the  most  acute  possibility,  has  far  stronger  con- 
sequences than  having  many  measurable  cardinals.  This  complemented  important 
developments  concerning  such  Jonsson  cardinals: 

First  came  the  following  observation: 

8.17  Proposition  (Tryba  [84],  Woodin).  Suppose  that  X is  regular  and  there  is  an 
S stationary  in  X such  that  S (T  a is  not  stationary  in  a for  any  limit  ordinal  a < X. 
Then  X is  not  Jonsson.  In  particular,  if  k is  regular,  then  k+  is  not  Jonsson. 

Proof.  By  a well-known  result  of  Solovay  that  will  be  later  established  in  context 
(16.9)  there  is  a partition  S — of  S into  disjoint  stationary  sets  Sn. 


8.  Partitions  and  Structures 


97 


Let  f:S  ->  A be  defined  by:  /(§)  = that  rj  such  that  f e S,r  Next,  let 
(A,  e)  -<  (Vx+W,  e>  with  A U [S',  /}  c A and  | A|  = A,  and  g\  A — >■  A a bijection. 
Now  consider  the  structure 


(A,e,{5},/,g>  . 

Because  of  g,  whenever  B — {B,  e,  {5},  /|B,  g|B)  -<  A,  |B|  = A iff  \B  fi  A|  = A, 
and  B — A iff  A C B.  So,  to  establish  that  A is  not  Jonsson  it  suffices  to  show 
by  8.12(c)  that  for  such  B,  \B  fl  A|  = A implies  that  A C B: 

Let  C = [f  < A | sup(B  I~l  £)  = §},  a set  closed  unbounded  in  A since 
\B  fl  A | = A.  Then  C fl  S C B:  Assume  to  the  contrary  that  § e (C  fl  S)  — B. 
Let  a be  the  least  member  of  B fl  A above  f . Then  a must  be  a limit  ordinal 
as  {B,  e)  -<  {Vx+W,  e)  implies  that  B is  closed  under  ordinal  predecessors.  So 
S e B,  (B,  e)  -<  {V\+lo,  e),  and  the  hypothesis  on  S imply  that  there  is  a D e B 
such  that  D is  closed  unbounded  in  a in  the  sense  of  B and  DHS  — 0.  But  then, 
| e C and  sup(B  fl  a)  — § implies  that  D H B D ^ is  unbounded  in  §,  reaching 
the  contradiction  £ e D fl  S. 

For  any  ?;  < A,  it  follows  that  C fl  c B,  and  since  C fl  St]  f 0,  there  is  a 
f e S,  fl  B.  But  then,  g(§)  = i]  e B.  Hence,  A c B. 

The  last  assertion  follows  by  taking  S = {a  < k+  \ cf(a)  = at } : For  any  limit 
ft  < k+  there  is  a set  closed  unbounded  in  f consisting  of  ordinals  of  cofinality 
less  than  k,  so  that  S fl  (3  is  not  stationary  in  f.  H 

With  some  elegant  combinatorics,  Todorcevic  improved  this  to  the  denial  of 
a partition  relation  for  pairs: 

8.18  Theorem  (Todorcevic  [87: 285]).  Suppose  that  A is  regular  and  there  is  an  S 
stationary  in  A such  that  S fl  a is  not  stationary  in  a for  any  limit  ordinal  a < A. 
Then  A — j>  [A]^.  In  particular,  if  k is  regular,  then  k+ — fr  [*c+]^+. 

Note  that  this  considerably  strengthens  the  implications  4.6(a)  and  7.8(d)  from 
weak  compactness.  It  also  shows  that  the  assumption  2K  — k+  is  unnecessary  in 
8.14(c)  when  k is  regular.  Whether  or  not  an— had  been  a prominent 
problem  of  infinitary  combinatorics  for  two  decades  (see  Problem  15,  Erdos-Hajnal 
[71])- 

The  question  of  whether  k+  can  be  Jonsson  when  k is  singular  (without 
assuming  2K  — k+)  remained,  and  seemed  quite  difficult.  Although  Shelah  [78] 
and  Tryba  [86,87]  imposed  several  constraints,  the  focal  case  k — a>w  seemed 
destined  to  be  resolved  eventually  by  a positive  consistency  result  relative  to  some 
strong  hypothesis.  So,  it  was  somewhat  unexpected  when  Shelah  established  in 
1988  that  unless  a singular  cardinal  k satisfies  some  stringent  conditions,  k+  is 
not  Jonsson.  This  was  one  of  the  many  fruits  of  Shelah’s  rich  pcf  theory  for  the 
study  of  powers  of  singular  cardinals;  see  Burke-Magidor  [90]  or  Jech  [92]  for  an 
exposition.  The  following  theorem  is  illustrative;  the  hypothesis  can  be  weakened 
further. 
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8.19  Theorem  (Shelah).  Suppose  that  a singular  cardinal  k is  not  the  limit  of 
regular  Jonsson  cardinals.  Then  tc+  is  not  Jonsson.  In  particular,  cow+ 1 is  not 
Jonsson. 

Todorcevic  also  refined  this  to  the  partition  relation  for  pairs;  see  Burke- 
Magidor  [90:224]. 

8.20  Theorem  (Todorcevic).  Suppose  that  a singular  cardinal  k is  not  the  limit 
of  regular  cardinals  X satisfying  X — > [a]?.  Then  k+ — f [tr+]^+.  In  particular, 
o>co+i—f> [a)m+ 1]^- 

The  overall  question  still  remains  open: 

8.21  Question.  If  k is  singular,  does  k+ — f [k+]1+? 

As  was  elaborated  in  some  detail,  interest  in  combinatorial  ramifications  of 
Rowbottom’s  work  has  persisted  to  the  present  day,  primarily  because  of  the 
enticing  possibility  that  strong  partition  properties  can  obtain  in  the  low  levels 
of  the  cumulative  hierarchy.  We  now  return  to  the  mid-1960’s,  and  Silver’s 
dramatic  amplification  concerning  L based  on  a concept  that  was  just  coming  into 
prominence:  set  of  indiscernibles  for  a structure. 
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9.  Indiscemibles  and  0# 

Jack  Silver  was  first  a student  at  Berkeley  and  then  joined  its  faculty  in  the  1960’s, 
a time  of  great  activity  there  in  set  theory.  His  remarkable  results  on  0#  (“zero 
sharp”)  appeared  in  his  1966  dissertation  [66],  rivaled  only  by  Kunen’s  (see  §19) 
of  a couple  of  years  later  for  its  impact  on  the  development  of  set  theory.  This 
section  is  devoted  to  Silver’s  results.  §14  discusses  the  definability  of  0#  and 
related  issues  about  the  forcing  method,  a line  of  inquiry  concurrently  initiated 
by  Solovay.  He  and  Silver  independently  isolated  0#  in  the  context  of  Silver’s 
analysis  of  L. 

Silver  was  motivated  by  and  extended  the  groundbreaking  work  of  Gaifman. 
A student  of  Tarski,  Gaifman  wrote  his  1962  Berkeley  dissertation  on  Boolean 
algebras.  Soon  afterwards,  he  had  turned  to  an  altogether  different  subject,  and 
established  a strong  extension  of  Scott’s  result  on  the  incompatibility  of  measurable 
cardinals  and  V — L : 

9.1  Theorem  (Gaifman  [64]).  Suppose  that  there  is  a measurable  cardinal.  Then: 

(a)  There  is  a closed  unbounded  class  C c {a  \ La  < L}  such  that  for 
any  infinite  fi,  there  is  an  a > (5  with  |oj  = |/3|  and  a e C.  This  implies  that 
[P(x)L\  = \x\  for  any  infinite  x e L. 

(b)  There  is  an  a C co  such  that  in  L\a\,  C is  a definable  class  and  (a)  holds. 
This  implies  that  arbitrarily  large  cardinals  in  L are  no  longer  cardinals  in  L[a\ 

H 


Gaifman  achieved  these  results  using  his  method  of  iterated  ultrapowers,  a 
method  to  be  further  exploited  by  Kunen  (§19).  With  (a)  Gaifman  had  indepen- 
dently derived  Rowbottom’s  local  conclusion  8.5(a),  but  moreover  had  shown  for 
the  first  time  that  drastic  global  consequences  hold  for  L in  the  presence  of  a 
measurable  cardinal.  Gaifman  was  not  far  away  from  some  of  Silver’s  results 
on  0#:  0#  is  a subset  of  co  satisfying  (b),  and  it  uniquely  determines  a closed 
unbounded  class  satisfying  (a);  beyond  that,  it  incorporates  a specific  generating 
scheme  for  L.  The  existence  of  0#  was  a genuinely  new  structural  principle,  and 
its  isolation  established  the  intrinsic  necessity  of  large  cardinals  for  transcendence 
over  L. 

The  story  of  0#  begins  in  model  theory.  Investigating  the  problem  of  getting 
models  of  theories  with  a large  number  of  automorphisms,  Andrzej  Ehrenfeucht 
and  Mostowski  developed  the  concept  of  indiscemibility,  and  brought  Ramsey’s 
theorem  7.7  into  model-theoretic  prominence.  For  M a structure  and  X a subset 
of  the  domain  of  M linearly  ordered  by  < (not  necessarily  a relation  of  M), 
{X.  <}  is  a set  of  indiscemibles  for  M iff  for  every  formula  (p(v\, ... , v„ ) in  the 
language  of  M and  < . . . < xn  and  y \ < ...  < yn  all  in  X, 

M |=  cp\x i , . . . , x„]  iff  M\=  <p[y\ , . . . , y„]  . 

That  is,  for  each  n e co  all  increasing  n -tuples  from  X have  the  same  first-order 
properties  in  M.  Variants  on  this  terminology,  e.g.  with  {X,  <}  replaced  by  X 
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when  the  < is  clear,  should  be  unambiguous.  The  following  became  a basic 
ingredient  of  several  important  results: 

9.2  Theorem  (Ehrenfeucht-Mostowski  [56]).  Suppose  that  T is  a theory  with 
infinite  models  and  (A,  <)  is  a linearly  ordered  set.  Then  there  is  a model  Ad  of 
T such  that  X is  included  in  its  domain  and  is  a set  of  indiscernibles  for  Ad. 

Proof.  Expand  the  language  of  T by  introducing  new  constants  cx  for  each  x e X 
and  consider  the  theory  T = TU[cx  f cy  \ x f y both  in  A}  U {<p(cXl , . . . , cxJ  +> 
<p(cy  j , ,cyJ  | <p(  Vi , . . , , vn)  is  a formula  in  the  language  of  T and  x\  < ...  < xn 
and  vi  < . . . < y„  all  in  A}.  It  suffices  to  show  that  T is  consistent,  which  by 
the  Compactness  Theorem  amounts  to  showing  that  every  finite  subset  of  T is 
satisfiable. 

Assume  then  that  S c T is  finite.  Let  A be  an  infinite  model  of  T,  and 
[a,-  | ; e cu}  distinct  members  of  the  domain  of  A.  Let  m be  the  number  of  new 
constants  appearing  among  the  members  of  S,  and  for  k < m define  fk  on  [oof 
by: 


fkih — , h ) = ...,vk)  | 

<p(cXl  ,---,cXk)o  <p(cyi , ...,cyk)  e S A A |=  (p[ah at  J]  . 

Since  S is  finite,  the  range  of  fk  is  finite,  so  applying  Ramsey’s  Theorem  7.7  m 
times  there  is  an  H e [oof0  homogeneous  for  each  fk.  Hence,  A satisfies  S with 
any  m elements  of  [a,-  | i e H)  assigned  to  the  new  constants  appearing  in  S in 
corresponding  increasing  order.  H 

Actually,  Ramsey’s  original  application  [30]  of  the  finite  version  of  his  theo- 
rem was  essentially  to  get  indiscernibles  in  a finite  model  context.  Michael  Morley 
[65, 65a]  drew  attention  to  9.2  by  applying  it  to  derive  important  results  in  model 
theory.  Since  its  proof  proceeds  by  a compactness  argument,  it  gives  little  infor- 
mation about  the  resulting  model.  Silver  pointed  out  that  strong  partition  relations 
directly  provide  indiscernibles  for  structures: 

9.3  Theorem  (Silver  [66,71]).  For  infinite  limit  ordinals  a,  k — > (ot)fw  iff for 
any  structure  Ad  for  a countable  language  with  k a subset  of  its  domain,  there  is 
a set  of  indiscernibles  X e [«•]"  for  Ad. 

Proof.  Let  {<pn  new)  enumerate  the  formulas  of  the  language  so  that  <pn  has 
at  most  the  variables  Vi, ...,  vk(n)  free  where  k(n)  < n.  Define  /:  [/f]<a'  — > 2 
by:  /(?  l.  • • ■ , £„)  = 0 if  Ad  |=  (pn[H\,  ■ ■ ■ , &(„)],  and  /(£  i,  = 1 other- 

wise. Then  any  set  homogeneous  for  / with  ordertype  a limit  ordinal  is  a set  of 
indiscernibles  for  Ad . 

Conversely,  suppose  that  /:  [Ar]<a’  -*  2 and  A is  a set  of  indiscernibles  for 
the  structure  {k,  €,  f\[K~\n)new.  Then  A is  homogeneous  for  /.  H 
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Silver  realized  that  because  of  the  uniformity  of  the  constructive  hierarchy 
L,  if  a sufficiently  rich  structure  had  enough  ordinal  indiscemibles,  the  theory  of 
the  structure  fueled  by  the  class  of  ordinals  in  the  role  of  indiscemibles  can  be 
used  to  generate  L.  This  harkens  back  to  Godel’s  original  impredicative  use  of 
the  class  of  ordinals  to  construct  L by  extending  type  theory,  but  the  indiscernible 
generation  shifts  the  weight  of  the  construction  squarely  on  the  theory  and  leads 
to  striking  consequences  about  the  distance  between  V and  L. 

Silver’s  work  is  now  cast  into  a series  of  lemmata  that  highlight  the  conse- 
quences of  the  various  conditions  leading  to  0#.  An  observation  of  later  signifi- 
cance is  that  these  results  do  not  depend  on  the  Axiom  of  Choice,  and  how  each 
apparent  use  can  be  avoided  is  described  at  its  juncture. 

By  3.3(a)  there  is  a formula  (Pq(vq,  iq)  that  defines  in  L a well-ordering  <L 
of  L such  that:  for  any  limit  8 > a>  and  x,  y e Lg,  x <l  y iff  L$  |=  <po [x , y]. 
For  each  formula  <p(v o, . . . , vm ) of  C€,  define  the  canonical  Skolem  term  t<p  for  <p 
using  <po  as  follows: 

t(p(vu  . . . , vm)  — Vo  iff  (Vi)m+2-,^(um+2,  tq, ...,  vm)  A Do  = 0)  v 

(<p(v 0,  iq Vm ) A Vt)m+i(<po(um+i,  u0)  -*  -•(p(vm+ 1,  tq, l»m)))  . 

For  any  Af  = (M,  E)  satisfying  the  requisite  well-ordering  properties  with  (po 
(e.g.  M is  elementarily  equivalent  to  some  {L$,  e)  with  8 a limit  ordinal  > &>), 
the  corresponding  expansion 

can  be  considered  where  the  well-defined  interpretation  t^4  is  a Skolem  function 
for  (p  such  that  t^4 (x i , . . . , x,„)  is  the  least  y according  to  (po  satisfying  M 
i, o[y , x\, . . . , xm]  when  one  exists.  Note  that  {t^4  \ <p  is  a formula  of  is  already 
closed  under  functional  composition  by  definability,  and  hence  is  a complete  set 
of  Skolem  functions  for  AT  Consequently,  for  X C M the  Skolem  hull  of  X in 
Af  can  be  taken  to  be  well-defined,  with  domain 


{t^{x\, . . . , xm)  | cp  is  a formula  of  and  x\, xm  e A}  . 

This  set  is  canonical:  It  coincides  with  the  collection  of  those  x e M such  that  {x} 
is  definable  over  Af  using  parameters  from  X,  since  that  collection  is  included  in 
any  elementary  substructure  of  Af  that  includes  X. 

By  Skolem  term  is  meant  one  of  the  tf  s in  this  section 

although  their  specific  form  will  not  matter  until  they  are  taken  up  again  in  sub- 
sequent sections. 

Theories  with  indiscemibles  are  considered  next.  Let  £*  be  augmented 
by  constants  {c>  | k e oj}.  By  an  EM  blueprint  (for  Ehrenfeucht-Mostowski)  is 
meant  the  theory  in  £*  of  some  structure 


(T5 , E,  Xk)keo> 
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where  S is  a limit  ordinal  > co  and  {**  I k e co } is  a set  of  ordinal  indiscemibles 
for  {Ls,  e)  indexed  in  increasing  order.  A basic  observation  is  that  for  any  limit 
ordinal  S > co,  any  infinite  set  of  ordinal  indiscemibles  for  (L$,  e)  uniquely  deter- 
mines an  EM  blueprint:  just  take  the  theory  of  (Ls,  £,  xfilkew  for  any  increasing 
subsequence  (xk  \ k e co)  of  the  indiscemibles. 

For  a theory  T in  £*,  let  T~  denote  its  restriction  to  £e,  i.e.  those  sentences 
of  T with  no  occurrence  of  any  cy . 

9.4  Lemma.  Suppose  that  T is  an  EM  blueprint.  Then  for  any  a there  is  a model 
At  — At(T,  a)  of  T~  unique  up  to  isomorphism  such  that: 

(a)  There  is  a set  X of  ordinals  in  the  sense  of  At  of  ordertype  a (under 
At  ’s  ordinal  ordering)  that  constitutes  a set  of  indiscemibles  for  At.  Moreover, 
for  any  formula  cp(v\, . . . , vn)  of  £g  an  increasing  n-tuple  from  X satisfies  <p  in 
At  exactly  when  <p(co, ... , c„_ i)  e T. 

(b)  The  Skolem  hull  of  X in  At  is  again  At. 

Proof.  By  definition  of  EM  blueprint,  T has  at  least  one  infinite  model.  So, 
the  proof  of  9.2  shows  that  there  is  a model  satisfying  (a).  Taking  the  Skolem 
hull  of  the  indiscemibles  in  this  model  results  in  a model  At  that  also  satisfies 
(b).  (Suppose  that  the  usual  Henkin  construction  for  the  Completeness  Theorem 
had  been  used  for  getting  the  model  satisfying  (a),  using  the  language  £“  with  a 
constant  for  each  member  of  a.  Then  the  resulting  “term”  model  already  satisfies 
(b),  and  no  appeal  to  the  Axiom  of  Choice  is  necessary  as  the  terms  of  £“  inherit 
a well-ordering  from  a.) 

Suppose  next  that  there  were  two  such  models,  At  and  At,  with  correspond- 
ing sets  of  indiscemibles  X and  X.  Since  X and  X have  ordertype  a,  let  h:  X — »■  X 
be  the  order  isomorphism.  Then  h extends  to  an  isomorphism  between  At  and 
At:  By  (b),  any  element  of  At  is  of  fomi  tM(x\ , . . . , xn)  for  some  Skolem  term 
t and  xi, . . . , xn  in  X,  and  similarly  for  At  and  X.  Hence,  it  suffices  to  show  that 
the  map  h given  by 

h(tM(x i,  = tM(h(x i) h(x„)) 

is  an  isomorphism:  For  any  formula  <p(v i, . . . , v„ ) and  increasing  n-tuple  (jci,  . . . , 
x„>  drawn  from  X, 

At  |=  (p[x  i, ....  x„\  iff  (p(c0, ...,  c„_i)  e T iff  At  \=  (p[h(x  i), ....  h(xn)]  . 

Skolem  terms  are  definable,  so  using  particular  formulas  asserting  the  equality  of 
Skolem  terms  it  can  be  checked  that  h is  well-defined,  i.e.  does  not  depend  on  the 
term  descriptions,  and  that  it  is  injective.  Similarly,  it  preserves  membership,  and 
clearly  it  is  surjective.  H 

Particularly  portentous  is  when  M{T,a)  is  well-founded;  it  then  has  a tran- 
sitive collapse,  which  must  be  of  the  form  {Ls,  e>  by  3.3(a).  In  that  case, 
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M(T , a)  is  identified  with  ( Ls , e)  . 

By  a standing  convention  {Ls,  e)  is  usually  denoted  by  Ls. 

9.5  Lemma.  Suppose  that  T is  an  EM  blueprint.  Then  A4(T,  a)  is  well-founded 
for  every  a iff 

(I)  M{T,  a)  is  well-founded  for  every  a < co\  . 

Proof.  Assume  that  for  some  a,  M(T,  a)  = (M,  E)  is  ill-founded.  Let 

{a,-  | i e a>)  e C0M  with  aI+i  E a,-  for  every  i e a> . 

(This  does  not  require  the  Axiom  of  Choice  as  ( M , E)  inherits  a well-ordering 
from  a,  and  so  such  a sequence  is  definable  by  recursion.)  Each  a,  is  of  form 
dM'K) (x\ , . . . , xf  for  some  Skolem  term  t and  indiscemibles  x\, . . . , Xj,  so  let  Y 
be  the  countable  set  consisting  of  the  indiscemibles  involved  in  these  terms.  If  A f 
is  the  Skolem  hull  of  Y in  M (T,  a),  then  A f is  ill-founded,  yet  Af  = M(T,  /3) 
where  /l  < a>\  is  the  ordertype  of  Y.  H 

This  result  motivates  the  use  of  a strong  partition  property  to  produce  EM 
blueprints  with  arbitrarily  large  well-founded  models: 

9.6  Lemma.  Suppose  that  there  is  a k satisfying  k — >■  Then  there  is  an 

EM  blueprint  satisfying  (I)  of  9.5. 

Proof.  By  9.3,  L,(  has  an  uncountable  set  of  ordinal  indiscemibles.  Let  T be 
the  corresponding  EM  blueprint.  Then  for  any  a < w\,  M(T,  a)  is  well-founded 
since  it  is  isomorphic  to  the  Skolem  hull  in  LK  of  the  first  a indiscemibles.  H 

(I)  already  yields  Rowbottom’s  conclusion  8.3: 

9.7  Lemma.  If  there  is  an  EM  blueprint  satisfying  (I),  then  V(co)L  is  countable. 

Proof.  M{T,  cof  = Ls  for  some  8 > co\,  so  that  V{co)L  c Ls.  In  particular,  if 
a e V(co)L,  then  a — tLs(x o, ....  x„)  for  some  Skolem  term  t and  indiscemibles 
xo  < ...  < x„.  Let  ( n lieu;}  be  the  increasing  enumeration  of  the  first  a> 
indiscemibles.  Since  each  k e co  is  definable, 

k e tL5(x0, . . . ,xn)  iff  k etLs(z0,...,z„) 

and  hence  a — tLs(zo,  ■ ■ ■ , Zn )•  Since  there  are  only  countably  many  such  forms, 
V(co)L  is  countable.  H 

The  hypothesis  of  9.6  entails  the  existence  of  an  EM  blueprint  satisfying  two 
further  conditions,  from  which  some  remarkable  conclusions  can  be  drawn.  On 
the  basis  of  its  proof,  specify  that 
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(i)  p is  the  least  limit  ordinal  such  that  Lp  has  a set  of  ordinal  indis- 
cemibles  of  ordertype  &>i; 

(ii)  H is  such  a set  of  indiscemibles  with  the  least  possible  &>th  element; 
and 

(iii)  Tq  is  the  corresponding  EM  blueprint. 

9.8  Lemma.  The  following  condition  holds  for  T — Tq: 

(II)  For  any  n- ary  Skolem  term  t,  T contains  the  sentence: 
r(c0, . . . , cn- 1)  e On  ->  r(c0, . . . , c„_i)  < c„  . 

Proof  Assume  to  the  contrary  that 

f (c0. . . . , c„_i)  e On  A c„  < t(c0, ...,  c„_i) 

is  in  To  for  some  t.  Let  zo  < ...  < z„~\  be  the  first  n members  of  our  fixed  set  H of 
indiscemibles  for  Lp,  and  set  H — H — {zo,  . . . , z„- i]  and  <5  — tL"{zo, . . . , zn- i)  < 
p.  By  our  assumption  and  indiscernibility,  H C.  8.  Moreover,  8 can  be  taken  to 
be  a limit  ordinal.  (If  8 — 8 + k where  k e co,  there  are  members  of  H below  8 
and  so  c„  < t(co, , c„_i)  — k holds,  and  hence  H c 8.) 

It  will  now  be  shown  that  H is  a set  of  indiscemibles  for  L$,  contradicting 
the  minimal  choice  of  p:  Suppose  that  x\  < . . . < xm  and  y\  < ...  < ym  are  all 
in  H.  Then  for  any  formula  <p(v i, . . . , vm ), 

(*)  Ts  \—  l)  ■ • • ? -V»i] 

is  equivalent  to  Lp  [=  q>Ls[x\, . . . , xm\,  this  relativization  being  possible  as  Ls  C 
Lp  are  transitive.  Noting  that  Lg  is  definable  in  Lp  from  8 (see  Devlin  [84:  II§2]) 
and  incorporating  the  definition  of  8,  (*)  is  equivalent  to 

Lp  (—  (p[pt\ > • • • > xm , Zo?  • • - j Zn— 1 J 

for  some  f.  Analogously, 

Lg  (=  ^[yi.  • ■ ■ j yin ] iff  Lp  |=  (p[yu  . . . , ym . zo.  • ■ • ) Zn—  l]  ■ 

Hence,  the  indiscernibility  of  H for  Lp  implies  the  indiscernibility  of  H for  L$. 

H 


The  following  characterization  is  immediate: 

9.9  Lemma.  An  EM  blueprint  T satisfies  (II)  of  9.8  iff  for  any  infinite  limit  ordinal 
a,  the  set  of  indiscemibles  corresponding  to  M.{T , a)  is  cofinal  in  the  ordinals  of 
the  structure.  ~\ 

Rowbottom  was  involved  in  this  work,  particularly  in  connection  with  the 
following  further  condition  (Devlin  [73:  195];  Silver  [71:  77]): 
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9.10  Lemma.  The  following  condition  holds  for  T = Tq: 

(III)  For  any  (m  + n + l)-ary  Skolem  term  t,  T contains  the  sentence: 
t(co,  • • • 5 Cm+n)  < cm  > 

t (Co » • • • > Cm+n ) — t (Co  •>  • • • > Cm  — 1,  ^m+M+1,  * - • > ^m+2/i+l)  • 

Remark.  This  is  known  as  the  remarkable  condition.  By  a simple  indiscemibility 
argument,  the  conclusion  can  be  replaced  by  the  self-refinement 

t (To ’ ■ * • , Cm+n)  = t (CQ , ■ • • , Cm— 1,  Qp  • • • , C£M+1) 

for  any  Aq  < . . . < A„+i  with  m < A| . Without  further  comment,  this  is  how  (III) 
will  be  invoked. 

Proof.  It  can  be  assumed  that 


is  in  Tq.  Let  H — [J{j|  | £ < cof  be  a disjoint  partition  into  sets  consist- 
ing of  consecutive  elements  of  H such  that  |sq|  = m,  and  |s,,|  = n + 1 and 
max(s|)  < minis,,)  for  0 < f < r]  < (o\.  Let  t{so,sf)  for  0 < f < a>i  denote 

tLp(x  o xm-\,yl, . . . , yl),  where  s0  = {xo,  ■■■,  *m-i}  and  = {y|, . . . , yf } 

in  increasing  order.  It  suffices  by  indiscemibility  to  derive  a contradiction  from  the 
assumption  that  f (so,  S|)  / t(so,  s,;)  for  some  (and  hence  any)  0 < § < q < a)\. 

If  r(s0,  S|)  > t(so,sn)  for  0 < £ < rj  < co\,  there  would  be  an  infinite 
descending  sequence  of  ordinals.  On  the  other  hand,  if  t(so,s%)  < t(so,stl)  for 

0 < f < r]  < a>i  then  it  is  simple  to  see  that  {f (so,  S|)  | 0 < £ < cof  would  be  a 

set  of  indiscemibles  for  Lp.  However,  the  first  element  yf  of  sw  is  the  &)th  element 
of  H and  t(so,sw)  < y%  by  our  initial  assumption,  contradicting  the  minimality 
of  the  &>th  element  of  H . -\ 

We  now  proceed  to  analyze  those  EM  blueprints  T that  satisfy  conditions 
(I)— (III).  For  such  a T and  for  any  a,  temporarily  let 

“ I H < «) 

denote  the  corresponding  increasing  sequence  of  indiscemibles  for  MiT,  a).  (Ill) 
has  the  following  primary  consequence: 

9.11  Lemma.  IfT  is  an  EM  blueprint  satisfying  (I) — (III)  and  a>  < a < j5  with  a 
a limit  ordinal,  then  the  Skolem  hull  of  {t|  | § < a]  in  M{T,  /3)  is  Llt  where 

1 = Iq-P.  Consequently, 

M(T,  a)  — L,  and  if01  = for  every  § < a . 

Proof.  Let  J\f  be  the  stated  Skolem  hull.  It  suffices  to  show  that  On^  = as 
the  second  sentence  follows  from  the  definition  and  uniqueness  of  A i(T,  a).  For 
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convenience,  the  superscripts  T'P  for  the  indiscemibles  will  be  suppressed  in  what 
follows. 

If  a is  an  ordinal  in  AT,  then  for  some  Skolem  term  t and  £o  < • • • < |„-i  < a, 

a = ^ t|n  i)  < < ia 

by  (II)  and  as  a is  a limit.  Conversely,  if  r < ia,  then 

r = M-M(7’'W(l?0,...,({._1,i,0 (,,)  < la 


for  some  Skolem  term  u and  indiscemibles  in  increasing  order  with 
By  (HI), 


r = UM™(ko,  . . . , l(m_l%l(m_1+ 1,  ■ ■ • , l(m-i+n+  0 - 

and  this  is  in  Af  as  a is  a limit  ordinal. 


< a < rj o- 


H 


Thus,  for  any  EM  blueprint  T satisfying  (I)— (III)  and  any  f , unambiguously 
set 

f = i| for  some  (any)  limit  ordinal  a > § , and 
I T = (if  II  e On}  . 

9.11  has  a host  of  consequences: 

9.12  Lemma.  Suppose  that  T is  an  EM  blueprint  satisfying  (1) — (III).  Then: 

(a)  L,t  -<  Llt  when  | < £. 

(b)  If  I = III  + for  every  £. 

(c)  7r  w a closed  unbounded  class  of  ordinals. 

(d)  For  any  cardinal  X > a>,  f = X e IT  and  so  Ad(T,  A)  = 

(e)  T is  the  only  EM  blueprint  satisfying  (I)— (III). 

Proof,  (a)  for  infinite  limit  ordinals  | < £ follows  directly  from  9.11.  Conse- 
quently, it  holds  for  arbitrary  f < £ by  an  indiscernibility  argument  within  some 
sufficiently  large  Lj  . 

If  a is  an  infinite  limit  ordinal,  then  Lj  is  the  Skolem  hull  in  itself  of 
{t|  | | < a}  by  9.11.  Hence,  |f  | = |a|  and  so  (b)  follows  for  every  f (c)  also 
follows,  since  {f  | | < «}  is  cofinal  in  f by  9.9. 

(d)  is  a consequence  of  (b)  and  (c). 

Finally,  for  (e)  note  that  T is  the  theory  of  ( LWm , e,  co„+  \)n€c0  by  (d).  H 

Assuming  (a)  and  (d),  L is  the  union  of  the  elementary  chain  (L-k  \ X > a>  is 
a cardinal),  so  that  the  satisfaction  relation  for  L can  be  defined  in  ZFC  by: 

L |=  (p[a\,  . . . , a„]  iff  Lk  |=  ip[a\ , . . . , an ] for  some  (any) 

X > co  such  that  a\ , . . . , an  e Lk  . 

In  particular,  a truth  definition  for  L can  be  given  by 
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L\=a  iff  Lm\=cr 

for  sentences  a.  Thus,  the  set  of  (Godel  numbers  of)  sentences  true  in  L is 
constructible.  It  cannot  be  definable  in  L by  the  undefinability  of  truth,  but  of 
course  (the  real)  to\  being  indiscernible  is  not  definable  in  L.  With  the  satisfaction 
relation  for  L in  hand,  assertions  like  the  following  are  directly  formalizable  in 
ZFC. 

9.13  Lemma.  Suppose  that  T is  an  EM  blueprint  satisfying  (I)— (III).  Then: 

(a)  Llt  -<  L for  every  £ . 

(b)  C is  a closed  unbounded  class  of  ordinal  indiscemibles  for  L such  that 
the  Skolem  hull  of  C in  L is  again  L iff  C = IT . 

Proof.  It  is  only  necessary  to  verify  the  forward  direction  of  (b):  Note  that  CD  /r 
is  infinite,  so  that  the  EM  blueprint  corresponding  to  C is  again  T . Let  h:  C — > IT 
be  the  order-preserving  bijection.  It  follows  by  the  uniqueness  argument  for  9.4 
that  h extends  to  an  isomorphism  L.  But  then,  h must  be  the  identity 

(else  it  would  move  some  ordinal,  removing  it  from  the  range).  Hence,  C — IT . 

H 


With  9.12(e)  in  hand,  stipulate  that 

0#  is  that  unique  EM  blueprint  satisfying  (I)— (III) 
if  there  is  one,  acceding  to  the  accepted  solecism 

0#  exists 

for:  There  is  an  EM  blueprint  satisfying  (I)— (III).  Through  a recursive  arithmeti- 
zation  of  £* , 0#  is  regarded  as  a subset  of  to.  While  Silver  had  concentrated  on 
the  theory,  Solovay  [67],  the  source  of  the  # notation,  emphasized  its  construal  as 
a subset  of  to.  The  sense  of  “0#  exists”  will  be  clarified  by  a later  absoluteness 
result  (14.12)  according  to  which  the  set  0#  L,  yet  remarkably  there  is  a formula 
that  defines  0#  in  any  model  of  ZF  containing  it.  Dropping  the  superscript  T in 
9.12  and  9.13  by  uniqueness,  the  following  is  a Hauptsatz  summarizing  Silver’s 
results: 

9.14  Theorem  (Silver  [66,71]). 

(a)  0#  exists  iff  some  L$  has  an  uncountable  set  of  indiscemibles.  Hence,  if 
k — > (toi)^10  for  some  k (e.g.  if  k is  measurable),  then  0#  exists. 

(b)  0#  exists  iff  there  is  a class  I of  ordinals  characterized  by:  I is  a closed 

unbounded  class  of  indiscemibles  for  L such  that  the  Skolem  hull  of  I in  L is 
again  L.  Moreover,  with  (i^  | £ € On)  the  increasing  enumeration  of  I it  has  the 
following  properties:  If  £ < then  Ll%  -<  Ll(  L and  |j||  = |§  | + Ko  so  that  I 
contains  every  uncountable  cardinal,  and  for  any  limit  ordinal  a > to  the  Skolem 
hull  of  [tj  | £ < a } in  Lla  is  again  Lia.  H 
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Thus,  Gaifman’s  9.1  conclusions  have  been  extended  in  an  ultimate  way: 
the  existence  of  0#  is  an  intrinsic  principle  that  provides  a complete  scheme  for 
constructing  L using  one  theory  and  a generating  class  of  ordinal  indiscemibles. 
Silver  also  reduced  Gaifman’s  hypothesis  to  k — > (cdOJ  With  the  next  result 
he  showed  that  this  reduction  is  sharp  in  the  hierarchy  of  Erdos  cardinals: 

9.15  Theorem  (Silver  [66,  70]).  Suppose  that  M is  an  inner  model  of  ZFC, 
a < cof,  and  k — » (a)^®.  Then  ( k — » Hence, 

Con(ZFC  + 3/rVa  < cofK  — » (a)J®)) 


implies 


Con(ZFC  + 3/rVa  < wx{k  — >•  (a) J®)  + 0#  does  not  exist)  . 

Proof.  Suppose  that  /:  \K~\<a>  — > S with  f e M.  A set  in  M of  ordertype  a must 
be  found  homogeneous  for  /.  Since  a < co™ , let  g:  co  a be  a bijection  with 
g e M.  Set  D = {d  \ d is  an  order-preserving  injection:  g“n  ->  k for  some  n, 
whose  range  is  homogeneous  for  /},  and  define  a partial  ordering  < on  D by: 
d ^ d iff  d D d.  Since  g e M and  {k<1°)m  = k<co,  (D,  k)  e M.  It  is 
straightforward  to  see  that  the  following  is  a theorem  of  ZFC: 

is  ill-founded  iff  there  is  a set  of  ordertype  a homogeneous  for  / . 

It  thus  follows  that  is  ill-founded  in  V.  But  then,  is  ill-founded  in  M by 
absoluteness  (0.3),  and  hence  the  above  theorem  used  in  M yields  the  desired 
result. 

The  last  assertion  follows  from  taking  M = L;  by  9.7  (there  is  no  EM 
blueprint  satisfying  (I))1.  H 


Note  that  M need  only  model  one  theorem  of  ZFC  of  sufficient  strength  to 
carry  out  the  foregoing  argument,  and  that  the  superscript  “<&>”  can  be  replaced 
by  “n”  for  any  particular  n e a>. 

There  have  been  further  developments  in  the  study  of  combinatorial  properties 
that  imply  the  existence  of  0#.  Kunen  showed  that  if  there  is  a Jonsson  cardinal, 
then  0#  exists  (21.4).  Devlin-Paris  [73]  showed  how  to  get  0#  from  a combinatorial 
consequence  of  k — (tt>i)J®.  Baumgartner-Galvin  [78]  formulated  a generalized 
version  of  Erdos  cardinals  sensitive  to  the  possible  EM  blueprints  produced,  and 
provided  sharp  implications  about  0#  in  their  framework. 

Finally,  there  is  a straightforward  way  to  get  an  equivalence,  at  the  cost  of 
incorporating  inner  model  hypotheses.  For  an  inner  model  M, 


asserts  that  for  every  function  /:  [i]]<w  —*■  2 such  that  / e M,  there  is  an  X e [?;]“ 
(not  necessarily  in  M)  homogeneous  for  /. 
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9.16  Exercise  (Gloede  [72: 153]).  The  following  are  equivalent: 

(a)  0#  exists. 

(b)  For  every  cardinal  X > a),  X — (X)^' “. 

(c)  For  some  q,  q — a.  H 

If  0#  does  exist,  it  is  not  unexpected  that  there  are  consequences  of  large 
cardinal  character,  especially  about  the  indiscemibles  [t|  | § e On}  themselves. 

9.17  Theorem.  Assume  that  0#  exists.  Then: 

(a)  There  is  an  elementary  embedding:  L < L. 

(b)  (t|  is  totally  indescribable) L for  every  f. 

(c)  [P{x)L\  = \x\  for  every  infinite  x e L. 

(d)  -»  (a)2<u)L  for  ever}’  f and  a < u>\. 

Proof,  (a)  Any  order-preserving  injection  h from  [i||  § e On}  into  itself  extends 
uniquely  to  an  h:  L < L defined  by: 

(cf.  the  proof  of  9.4).  Any  such  h which  is  not  the  identity  thus  induces  an 
elementary  embedding  that  moves  some  ordinal. 

(b)  By  indiscemibility,  it  suffices  to  show  that  to  is  totally  indescribable  in 

L.  Let  h be  any  map  as  above  such  that  h(i0)  > i0.  Then  i0  is  the  critical  point 
of  h:  L < L:  If  S — tL{ify  ....  i|n)  < to  with  then 

m = tLm fl), . . . , *(»£.))  = fL(4 1, — *&)  = 5 

by  (III).  The  argument  for  6.6  now  shows  that  to  must  be  totally  indescribable  in 
L. 

(c)  By  (b),  the  (real)  cardinal  |x|+  is  inaccessible  in  L. 

(d)  It  suffices  by  indiscemibility  to  establish  the  property  assuming  that  ij 

is  a cardinal  X > a>.  Suppose  then  that  /:  [A]<tt>  -»  2 and  f e L.  Then 
/ = tL(tf0, . . . , t|m_,,  tfo,  . . . , tfn)  say,  with  the  indiscemibles  in  increasing  or- 
der and  tfm_ j < X — ix  < tfo.  By  an  indiscemibility  argument  it  follows  that 
[t|m_1+i+^  | fl  e a]  is  homogeneous  for  /.  Hence  by  9.15,  there  is  a homoge- 
neous set  in  L of  ordertype  a.  H 

Digressing  briefly,  a proof  of  7.16  is  sketched  as  previously  promised: 

9.18  Exercise  (Reinhardt-Silver  [65]).  There  is  a totally  indescribable  cardinal 
below  k(co). 

Flint.  Let  k = W a well-ordering  of  VK,  and  Iq  a set  of  indiscemibles  of 

ordertype  a>  for  (VK,G,W).  Let  Fl  < VK  be  the  Skolem  hull  of  Iq  in  VK  with 
respect  to  Skolem  functions  defined  with  W,  Hq  the  transitive  collapse  of  H,  and  e 
the  inverse  of  the  collapsing  isomorphism.  Note  that  Hq.  H,  and  VK  are  all  models 
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of  ZFC  since  k is  inaccessible.  As  in  the  proof  of  9.17(b),  any  order-preserving 
injection  h:  Iq  ->  Iq  which  is  not  the  identity  induces  an  elementary  embedding 
h:  Hq  -<  Hq  whose  critical  point  S satisfies  Hq  |=  8 is  totally  indescribable.  But 
then,  e(8)  is  totally  indescribable,  since  it  is  so  in  H and  hence  in  VK.  H 

Sharps 

The  formulation  of  0#  for  L can  be  relativized  in  a straightforward  way  to  inner 
models  L[a ] (§3)  for  sets  a C On  to  produce  corresponding  sets  a#  c On.  Hence, 
# can  be  considered  as  an  operation  on  sets  of  ordinals,  with  0#  for  L = L[ 0]. 
Suppose  that  a is  a set  of  ordinals,  say  with  a — Ua,  which  can  be  taken  to  be 
infinite  to  avoid  trivialities.  An  EM  blueprint  for  a is  the  theory  of  some  structure 

(L5[a],  e,xk,a,%)k€l0^  <a 

where  8 > a is  a limit  ordinal  and  [xk  \ k e &>}  is  a set  of  ordinal  indiscemibles 
for  (Lj[a],  a indexed  in  increasing  order.  To  get  an  EM  blueprint  for 

a satisfying  (the  analogues  of)  the  conditions  (I)— (III),  uncountably  many  ordinal 
indiscemibles  are  needed  for  a structure  M for  a language  with  |a|  non-logical 
symbols.  For  this  purpose,  instead  of  the  special  argument  of  9.3,  if  {(p^  \ f < |a|} 
enumerates  the  formulas  of  the  language  with  (p f having  at  most  the  variables 
Vi,  . . . , vktf)  free,  the  function 

/(It,  ...,£.)  = R < |a|  | k{0  < n A M \=  — »:?*(?)]} 

can  be  used.  Consequently,  a cardinal  k satisfying  the  partition  property 

k — > (<wi)^® 

suffices.  All  further  details  go  through  with  little  change  to  procure  a unique 
EM  blueprint  a#  satisfying  (I)— (III)  which  through  coding  can  be  regarded  as  a 
subset  of  \oi\,  and  a corresponding  closed  unbounded  class  of  ordinal  indiscemibles 
Ia  for  (L[a],  e,a,f)£<„.  a#  for  a c a>  hold  the  main  interest  because  of  the 
intrinsic  importance  of  the  reals,  construed  here  as  subsets  of  a>.  (From  14.16  on, 
a shift  is  made  to  a#  for  reals  a construed  as  members  of  wco.)  No  excursion  into 
uncountable  languages  is  then  necessary: 

9.19  Theorem.  If  k — > (o>]  )f"  for  some  k,  then  a # exists  for  every  a c a>. 

-\ 


The  isolation  of  0#  was  the  culmination  of  a flow  of  ideas  from  early  efforts 
in  the  partition  calculus,  through  the  infusion  of  model-theoretic  techniques,  to 
the  investigation  of  L.  After  Silver’s  results  on  0#  together  with  Solovay’s  on  its 
definability  (§14)  established  its  intrinsic  importance,  further  results  provided  new 
structural  insights.  Kunen  soon  showed  (21.1)  that  the  converse  of  9.17(a)  holds: 
If  there  is  an  elementary  embedding  j:  L -<  L,  then  0#  exists . This  result  provided 
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a simple  paradigm  for  transcendence  over  inner  models  M,  namely  the  existence 
of  an  elementary  embedding:  M < M.  Then  in  1974,  Jensen  established  his 
celebrated  Covering  Theorem  (see  volume  II),  easily  the  most  important  result  of 
the  1970’s  in  set  theory:  0#  does  not  exist  iff for  any  uncountable  set  X C On, 
there  is  a Y e L such  that  Y I)  X and  |T|  = |X|.  Not  only  does  this  result 
provide  a surprisingly  weak  condition  for  the  existence  of  0#,  but  it  implies  in 
the  absence  of  0#  that  some  of  the  regular  behavior  of  cofinalities  in  L lifts  to 
V.  These  results  have  buttressed  the  existence  of  0#  as  the  focal  hypothesis  of 
transcendence  over  L. 


Chapter  3 

Forcing  and  Sets  of  Reals 


This  chapter  describes  the  first  advances  using  Cohen’s  method  of  forcing  that 
involved  large  cardinals  and  the  first  applications  of  large  cardinals  in  descriptive 
set  theory.  Cohen’s  creation  transformed  set  theory,  and  large  cardinal  hypotheses 
played  an  increasingly  prominent  role  as  a consequence.  §10  discusses  the  devel- 
opment of  forcing,  reviews  the  basic  theory,  and  then  focuses  on  mild  extensions 
and  the  Levy  collapse.  §11  is  devoted  to  Solovay’s  inspiring  result  that  if  there 
is  an  inaccessible  cardinal,  then  in  an  inner  model  of  a forcing  extension,  every 
set  of  reals  is  Lebesgue  measurable.  §12  reviews  the  historical  development  of 
descriptive  set  theory  and  establish  a working  context,  one  in  which  the  classical 
results  are  established  in  §13  through  to  a delimitation  established  by  Gddel  with 
L.  This  sets  the  stage  for  the  further  results  about  large  cardinals  and  projective 
sets,  a major  direction  of  set-theoretic  research  from  the  mid-1960’s  onwards. 
§14  describes  Solovay’s  germinal  work  on  sets  that  grew  out  of  his  Lebesgue 
measurability  result,  and  his  results  and  conjectures  on  the  definability  of  sharps. 
Then  §15  describe  how  Martin  used  sharps  to  extend  the  methods  of  classical 
descriptive  set  theory  to  the  analysis  of  £3  sets. 
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10.  Development  of  Forcing 

In  April  of  1963  the  analyst  Paul  Cohen  of  Stanford  University  circulated  notes 
sketching  proofs  of  the  independence  of  the  Axiom  of  Choice  from  ZF  and  of 
the  Continuum  Hypothesis  from  ZFC.  These  proofs,  of  course,  were  the  inau- 
gural examples  of  his  technique  of  forcing  for  extending  models  of  set  theory. 
Cohen  lectured  on  his  results  on  3 May  1963  at  the  Institute  for  Advanced 
Study,  and  then  two  articles  [63,64]  on  the  CH  result  were  quickly  communi- 
cated by  Godel  to  the  Proceedings  of  the  National  Academy  of  Sciences  U.S.A., 
where  his  own  consistency  results  with  L had  appeared  a quarter  of  a century 
before. 

Stirred  by  clear  indications  of  a general  procedure  in  Cohen’s  breakthrough, 
logicians  close  at  hand  soon  developed  the  theory  of  forcing  and  began  to  es- 
tablish further  results  with  it.  By  mid- 1963  Solomon  Feferman  at  Stanford  had 
results  in  first-  and  second-order  arithmetic  and  on  the  Axiom  of  Choice,  and  ar- 
riving there  in  the  summer  Levy  quickly  obtained  further  results  elaborating  AC. 
Then  two  results  displaying  a notable  sophistication  were  established  at  Princeton 
University:  William  Easton’s  [64,70]  in  late  1963  on  powers  of  regular  cardi- 
nals by  class  forcing,  and  Solovay’s  [65b,  70]  in  mid-1964  that  if  there  is  an 
inaccessible  cardinal,  then  in  an  inner  model  of  a forcing  extension,  every  set  of 
reals  is  Lebesgue  measurable.  This  preparatory  section  discusses  the  development 
of  the  forcing  method  and  sets  the  stage  for  its  interplay  with  large  cardinals, 
especially  toward  Solovay’s  result.  See  Moore  [88]  for  more  on  the  origins  of 
forcing. 

Forcing  provided  a remarkably  general  and  flexible  scheme  with  strong  in- 
tuitive underpinnings  for  establishing  relative  consistency  and  independence.  Ac- 
cording to  Scott  (Bell  [85:  ix]):  “Set  theory  could  never  be  the  same  after  Cohen, 
and  there  is  simply  no  comparison  whatsoever  in  the  sophistication  of  our  knowl- 
edge about  models  of  set  theory  today  as  contrasted  to  the  pre-Cohen  era.”  In 
retrospect  one  can  point  to  precursors  of  forcing  in  the  Beth  semantics  for  intu- 
itionistic  logic  (see  Dummett  [77:  §5]);  in  the  study  of  2-valued  homomorphisms 
of  Boolean- valued  models  (see  Rasiowa-Sikorski  [63:  VIII]);  in  Stephen  Kleene 
and  Emil  Post’s  argument  for  producing  incomparable  Turing  degrees  (see  Ler- 
man  [83:  II§2]);  and  especially  in  Clifford  Spector’s  two-quantifier  argument  for 
producing  minimal  Turing  degrees  (see  Lerman  [83:  V§2]).  However,  Cohen  was 
unaware  of  these  adumbrations  and  started  with  simple  and  basic  intuitions.  His 
particular  achievement  lies  in  devising  a concrete  procedure  for  extending  models 
of  set  theory  in  a minimal  fashion  without  altering  the  ordinals.  Scott  continued 
(Bell  [85:  ix]): 

I knew  almost  all  the  set-theoreticians  of  the  day,  and  I think  I can  say  that  no  one  could 
have  guessed  that  the  proof  would  have  gone  in  just  this  way.  Model-theoretic  methods 
had  shown  us  how  many  non-standard  models  there  were;  but  Cohen,  starting  from  very 
primitive  first  principles,  found  the  way  to  keep  the  models  standard  (that  is,  with  a 
well-ordered  collection  of  ordinals). 
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Set  theory  had  undergone  a sea-change,  and  beyond  how  the  subject  was  enriched, 
it  is  difficult  to  convey  the  strangeness  of  it. 

Beyond  his  initial  articles  [63, 64]  Cohen  provided  a further  account  [65]  and 
then  a full  exposition  in  the  monograph  [66]  based  on  his  lectures  at  Harvard 
University  in  the  spring  of  1965.  Later,  he  expressed  his  own  philosophical  views 
on  the  foundations  of  set  theory  in  a contribution  [71]  to  the  Axiomatic  Set  Theory 
conference  held  during  the  summer  of  1967  at  the  University  of  California  at  Los 
Angeles  (U.C.L.A.).  By  all  accounts  this  was  one  of  those  rare,  highly  exhilarating 
conferences  that  both  summarized  the  progress  and  focused  the  energy  of  a new 
field  opening  up. 

Cohen  first  discussed  and  compared  the  realist  and  formalist  positions  in 
mathematics.  He  considered  ([71: 1 Iff])  that  as  regards  the  existence  of  a measur- 
able cardinal  “there  seems  absolutely  no  intuitively  convincing  evidence  for  either 
rejection  or  acceptance.”  Concerning  CH  he  reckoned  that  it  may  well  come  to 
be  accepted  as  false  ([71: 12]):  “The  justification  might  be  that  the  continuum, 
which  is  given  to  us  by  the  power  set  construction,  is  not  accessible  by  any  pro- 
cess which  attempts  to  build  up  cardinals  from  below  by  means  of  a construction 
based  on  the  Replacement  Axiom.  Thus  C [i.e.  2K°]  would  be  considered  greater 
than  Ki,  K„,  etc.”  Presumably  then,  2K°  would  have  to  be  weakly  inaccessi- 
ble. Cohen’s  speculation  here  was  from  a realist  perspective,  one  advocated  more 
directly  in  [66:  151]. 

Acknowledging  the  influence  of  Robinson  [65],  Cohen  [71: 13]  then  came 
down  on  the  formalist  side.  He  continued  (p.  14): 

Another  example  of  how  repeated  exposure  dulls  the  critical  faculties  is  the  Axiom  of 
Inaccessible  Cardinals.  The  usual  justification  given  for  assuming  it  is  a negative  one, 
namely  that  it  is  unreasonable  to  think  that  every  set  is  accessible.  The  analogy  is  drawn 
with  the  passage  from  finite  to  infinite  sets.  If  one  performs  by  induction  a transfinite 
sequence  of  suitable  closure  operations,  it  is  argued,  we  can  still  go  further  and  presumably 
find  an  inaccessible  cardinal  beyond.  However,  I feel  that  this  is  a specious  argument, 
since  it  tends  rather  to  justify  the  existence  of  a standard  model  for  set  theory,  which  is  an 
incomparably  weaker  assertion.  A more  honest  reason  for  accepting  inaccessible  cardinals 
is  that  experience  has  shown  that  they  do  not  lead  to  contradictions  and  we  have  developed 
some  kind  of  intuition  that  no  such  contradiction  exists. 

This  point  of  view  stands  in  distinct  opposition  to  those  of  Zermelo  [30](§  1)  and 
Gddel  [47](§4). 

Cohen’s  concluding  remarks  notably  anticipate  later  progress  in  large  cardi- 
nals, but  are  tempered  by  an  undercurrent  of  doubt  and  uncertainty: 

Yet  mathematics  may  be  likened  to  a Promethean  labor,  full  of  life,  energy  and  great 
wonder,  yet  containing  the  seed  of  an  overwhelming  self-doubt.  It  is  good  that  only 
rarely  do  we  pause  to  review  the  situation  and  to  set  down  our  thoughts  on  these  deepest 
questions.  During  the  rest  of  our  mathematical  lives  we  watch  and  perhaps  partake  in 
the  glorious  procession.  Great  questions  of  set  theory  that  seemed  untouchable  eventually 
yield.  New  axioms  are  investigated  and  larger  and  larger  cardinals  are  somehow  brought 
closer  to  our  intuition.  Through  all  this,  number  theory  stands  as  a shining  beacon.  If,  as 
I hope  does  not  happen  too  often,  our  doubts  begin  to  overpower  us,  we  retreat  back  into 
the  safe  confines  of  number  theory  until  refreshed,  we  venture  out  again  into  the  unsafe 
ground  of  set  theory.  This  is  our  fate,  to  live  with  doubts,  to  pursue  a subject  whose 
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absoluteness  we  are  not  certain  of,  in  short  to  realize  that  the  only  “true”  science  is  itself 
of  the  same  mortal,  perhaps  empirical,  nature  as  all  other  human  undertakings. 

It  is  notable  that  in  comments  made  two  decades  later,  Cohen  was  to  stress 
the  formal  solution  of  the  Continuum  Problem,  without  further  speculating  on  the 
falsity  of  CH  (Albers  et  al.  [90:  54]): 

Certainly  Godel  himself  had  the  platonic  view  that  the  question  demanded  an  absolute 
answer  and  that,  therefore,  neither  his  proof  of  the  consistency  of  the  continuum  hypothesis 
with  the  axioms  of  set  theory  nor  mine  of  its  independence  from  them  was  the  final  answer. 

My  personal  view  is  that  I regard  the  present  solution  of  the  problem  as  very  satisfactory. 

I think  that  it  is  the  only  possible  solution.  It  gives  one  a feeling  for  what’s  possible  and 
what’s  impossible,  and  in  that  sense  I feel  that  one  should  be  very  satisfied... There  will  be 
philosophical  papers,  but  I don’t  think  any  mathematical  paper  will  say  that  there  is  any 
answer  other  than  the  answer  that  it’s  undecidable. 

Forcing  was  quickly  forged  into  a general  method,  particularly  by  the  efforts 
of  Solovay.  He  above  all  epitomized  this  period  of  great  expansion  in  set  the- 
ory, with  his  mathematical  sophistication  and  fundamental  results  with  forcing, 
in  large  cardinals,  and  in  descriptive  set  theory.  Following  initial  graduate  study 
in  differential  topology,  Solovay  focused  his  energies  on  set  theory  after  attend- 
ing Cohen’s  3 May  1963  lecture.  He  first  extended  the  independence  of  CH  by 
characterizing  the  possibilities  for  the  size  of  the  continuum  ([63,65]),  and  then 
generalized  forcing  to  arbitrary  partial  orders  and  dense  sets.  He  next  established 
his  Lebesgue  measurability  result  during  March-July  1964  ([70:  1]).  Then  with 
Stanley  Tennenbaum  he  worked  out  the  iterated  forcing  proof  of  the  consistency 
of  the  classical  Suslin  Hypothesis  in  June  1965  (Solovay-Tennenbaum  [71:201]). 

Complications  in  describing  when  a formula  holds  over  a range  of  generic 
extensions  soon  led  Solovay  to  the  idea  of  a Boolean  value  for  the  formula.  It 
turned  out  that  Petr  Vopenka  had  developed  a similar  concept  in  a reworking  [64] 
of  the  independence  of  CH.  (Notably,  this  paper  contains  the  remark  that  any 
inaccessible  k can  be  made  weakly  but  not  strongly  inaccessible  by  rendering 
2K,i  > k and  preserving  cardinals.)  The  conception  was  generalized  and  simplified 
in  a series  of  papers  on  the  so-called  V-models  from  the  active  Prague  seminar 
founded  by  Vopenka  (see  Hajek  [71:  78]),  culminating  in  the  exposition  Vopenka 
[67].  However,  the  earlier  papers  did  not  have  much  impact,  partly  because 
of  an  involved  formalism  in  which  formulas  were  valued  in  a complete  lattice 
rather  than  Boolean  algebra.  Working  independently,  Scott  and  Solovay  developed 
the  idea  of  recasting  forcing  entirely  in  terms  of  Boolean-valued  models,  and 
came  up  with  a formulation  essentially  equivalent  to  Vopenka’s.  The  method  is 
completely  general,  since  any  partial  order  can  be  “completed”  in  a natural  way 
to  get  a complete  Boolean  algebra.  Scott  popularized  this  approach  in  his  own 
reworking  [67]  of  the  independence  of  CH  and  in  lecture  notes  distributed  at  the 
1967  U.C.L.A.  conference. 

Boolean-valued  models  first  showed  how  to  avoid  Cohen’s  ramified  language 
connection  with  the  L hierarchy  and  his  dependence  on  a countable  e-model.  With 
their  elegant  algebraic  trappings  and  seemingly  more  complete  information,  they 
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held  the  promise  of  being  the  right  approach  to  independence  results.  J.  Barkley 
Rosser  wrote  a book  on  the  subject  entitled  Simplified  Independence  Proofs  [69], 
and  the  initial  iterated  forcing  paper  Solovay-Tennenbaum  [71]  was  written  in 
terms  of  Boolean-valued  models.  But  already  at  U.C.L.A.,  the  lucid  lectures 
of  Joseph  Shoenfield  [71]  had  demonstrated  that  forcing  with  partial  orders  can 
accommodate  the  gist  of  the  Boolean  approach  in  a straightforward  manner.  More- 
over, Boolean-valued  models  were  soon  found  to  be  too  abstract  and  unintuitive 
for  establishing  new  consistency  results,  so  that  within  a few  years  set  theorists 
were  generally  working  with  partially  orders.  It  is  a testament  to  Cohen’s  concrete 
approach  that  in  this  return  from  abstraction,  even  the  use  of  ramified  languages 
has  played  an  essential  role  in  careful  forcing  arguments  at  the  interface  of  recur- 
sion theory  and  set  theory. 

During  this  period  new  set-theoretic  results  were  being  established  at  a bewil- 
deringly  fast  pace,  in  the  first  wave  of  applications  of  forcing  and  the  coordinated 
elaboration  of  large  cardinals.  A timely  survey  by  Adrian  Mathias,  circulated  in 
typescript  soon  after  the  U.C.L.A.  conference  and  eventually  appearing  as  Mathias 

[79] ,  played  an  influential  role  in  communicating  and  systematizing  these  devel- 
opments. In  particular,  it  was  a vital  source  for  graduate  students  at  Berkeley  who 
were  later  to  become  prominent  set  theorists,  like  James  Baumgartner,  Richard 
Laver,  and  William  Mitchell. 

Forcing  Preliminaries 

The  basics  about  forcing  through  two-step  iterations  as  presented  in  the  texts  Kunen 

[80] ,  Jech  [78],  or  Bell  [85]  are  assumed.  By  and  large,  the  rather  standard  partial 
order  approach  and  terminology  of  the  first  text  is  adopted;  the  Boolean-valued 
approach  is  integrated  into  the  second  and  prominent  in  the  third.  Anticipating  the 
wide-ranging  use  of  forcing  in  volume  II,  the  salient  and  sometimes  distinctive 
features  are  described  in  what  follows: 

For  forcing  purposes,  a partial  order  (p.o.)  is  a partially  ordered  set  (P,  <) 
that  has  a maximum  element  denoted  by  11,  and  for  p,q  e P (the  conditions) 

p < q iff  p extends  or  refines  q , 

i.e.  less  (in  <)  is  more  (informative).  This  conforms  to  the  Boolean-valued  ap- 
proach and  suggests  that  p allows  fewer  models  than  q.  For  simplicity  ( P , <)  is 
usually  denoted  by  P,  but  if  there  is  a need  to  be  specific,  <p  is  written  for  < 
and  Up  for  11. 

It  is  not  necessary  to  assume  that  < is  antisymmetric  (see  Kunen  [80:  52ff]) 
and  in  iterated  forcing  arguments  the  naturally  defined  orders  are  not.  However, 
for  convenience  in  such  situations 

p and  q are  identified  whenever  p < q and  q < P 

(so  that  Ip  is  the  maximum  element),  effectively  passing  to  a p.o.  of  equivalence 
classes,  but  avoiding  awkward  notation. 
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Set: 


p ||  q iff  p and  q are  compatible,  i.e.  3 r(r  < p A r < q)  . 
p _L  q iff  p and  q are  incompatible,  i.e.  —<(p  ||  q)  . 


For  D C P, 


D is  dense  (in  P) 
D is  predense  (in  P) 
D is  dense  open  (in  P) 
D is  an  antichain  (in  P) 
D is  an  antichain  below  p 


iff  V p e P3d  e Did  < p)  . 
iff  ip  e P3d  e Did  ||  p)  . 

iff  D is  dense  A V< d e Dip  e Pip  < d -»  p e D)  . 
iff  Vp,  e Dip  ± q ->  p±q). 
iff  D is  an  antichain  in  {q  \ q < p}  . 


If  D is  either  dense  or  a maximal  antichain,  then  it  is  predense. 

Suppose  that  M is  a transitive  e-model  of  ZFC.  Then  for  any  set  x, 

M[x]  is  the  smallest  transitive  e-model  of  ZFC  including  M U {a  }, 

assuming  one  exists.  For  a p.o.  P e M and  G c P, 

G is  P-generic  over  M iff 

(!)  'ip,  q e G3/-  e G(r  < p a r < q)  , 

(ii)  ip  e Giq  e Pip  < q ->  q e G)  , and 

(iii)  iD  c P (D  dense  inP  A /)  e M ->  GnD/0). 


Here,  “dense”  can  be  replaced  by  “predense”.  For  such  G,  it  is  simple  to  see  that 
for  any  maximal  antichain  A c P with  A e M,  A n G has  exactly  one  element; 
more  generally,  if  p e G and  A is  a maximal  antichain  below  p with  A e M, 
then  A n G has  exactly  one  element.  In  forcing  arguments,  M is  the  ground  model 
and  M[G ] the  generic  extension.  Trivial  p.o.’s  leading  to  M = M[G\  are  allowed 
to  facilitate  the  statements  and  proofs  of  theorems. 

No  particular  formalization  of  the  forcing  language  is  adopted,  but  the  fol- 
lowing are  specified  about  the  names  (forcing  terms):  Names  (or  P-names  if  there 
is  a need  to  be  specific)  are  denoted  by  dotted  letters,  like  i,  and  their  interpreta- 
tions in  a generic  extension  M[G]  by  iG  or  This  approach  is  taken  with 

terms  definable  in  the  language  of  set  theory,  e.g. 

d>2  is  a name  for  the  second  uncountable  cardinal 

in  the  generic  extension.  A basic  observation  is  that  a set  x belongs  to  M[G] 
iff  x is  definable  in  M[G\  from  G and  finitely  many  elements  of  M,  and  hence 
corresponds  to  a name.  If  x e M,  then  x denotes  the  canonical  name  for  x,  so 
that  xG  = x for  every  P-generic  G over  M.  As  is  customary,  a is  written  instead 
of  a for  ordinals  a e M,  since  ordinals  are  absolute  for  all  generic  extensions.  G 
is  the  canonical  name  for  the  generic  object,  and  V the  canonical  name  for  the 
ground  model.  By  this  means,  the  ground  model  can  be  considered  a (definable) 
class,  and  hence  an  inner  model,  in  any  generic  extension. 

The  forcing  relation  for  P is  denoted  by 
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P Ibp  V , 

and  Ibp  <P  iff  Ip  Ibp  V , and 

p ||p  (p  iff  p decides  <p,  i.e.  p ||-/>  <p  or  p ||-p  —><p  . 

The  subscript  P is  suppressed  when  clear  from  the  context,  which  should  also 
differentiate  from  ||  used  for  compatibility  of  conditions.  Specific  formulas  <p  are 
rendered  in  mathematical  English,  only  set  apart  by  the  Quinean  quotes  r 1 when  a 
misreading  is  possible.  |b  is  the  standard,  deductively  closed  relation,  historically 
called  “weak  forcing”  and  differing  from  Cohen’s  original  relation  (see  Kunen 
[80: 235ff] ).  Among  its  well-known  properties  is  the  Maximal  Principle'. 

If  p |b  3 x<p(x,  i\, , i„),  then  for  some  name  t,  p ||-  (p{i,  i\, ...  ,in)  . 

This  requires  the  Axiom  of  Choice  in  the  ground  model  and  is  crucial  in  several 
arguments.  Finally,  the  canonical  names  G for  the  generic  object  and  V for  the 
ground  model  satisfy  the  following  for  any  p.q  e P and  name  t,  in  conformance 
with  the  recursive  definition  of  the  forcing  relation  for  atomic  formulas: 

p \[  q eG  iff  Vr  < p3s  < r(s  < q)  . 

p |b  i e V iff  Vr  < p3s  < r3x(s  |[-  x = i)  . 

Forcing  consistency  results  are  usually  stated  syntactically,  in  terms  of  the 
existence  of  a p.o.  P so  that  for  the  desired  assertion  <p,  ||-/>  <p.  This  succinct 
formulation  makes  the  closest  approach  to  the  corresponding  Unitary  relative  con- 
sistency result,  as  |b/>  o for  any  ZFC  axiom  a and  |b/>  is  deductively  closed. 
However,  as  is  the  usual  practice  proofs  are  often  given  semantically  as  foreshad- 
owed by  the  discussion  of  transitive  e-models,  and  corresponding  to  the  syntactic 
formulations  V is  regarded  as  the  ground  model  and  generic  extensions  V[G]  are 
“taken”.  With  this  in  mind, 

P-generic  means  P-generic  over  V. 

This  of  course  runs  counter  to  the  Cantor-Gddel  view  of  the  class  V — [x  \ x = x] 
as  the  universe  of  all  sets,  and  the  travesty  is  often  compounded  by  talking  of 
extensions  of  V[G ].  However,  it  is  well-known  that  this  semantic  approach  can 
be  regarded  as  merely  ime fagon  de parler  for  formalizable  arguments  about  forcing 
relations,  and  it  not  only  taps  a vital  source  of  intuitions  about  forcing  but  is  often 
the  only  way  to  render  arguments  intelligible. 

An  important  feature  of  the  theory  is  that  generic  extensions  can  be  identified 
through  embeddings  of  the  corresponding  p.o.’s:  For  p.o.’s  P and  Q, 
i:  P — > Q is  a complete  embedding  iff 

(i)  Vp,  q e P(p  <q->  i (p)  < /(<?))  , 

(ii)  V p,  q e P(p  _L  q — > i(p)  _L  i(q))  , and 

(iii)  if  A is  a maximal  antichain  in  P , 
then  i“A  is  a maximal  antichain  in  Q . 

i : P ->  Q is  a dense  embedding  iff  i satisfies  (i)  and  (ii)  above, 
and  i“P  is  dense  in  Q . 


120 


Chapter  3.  Forcing  and  Sets  of  Reals 


These  definitions  do  not  require  i to  be  injective.  A dense  embedding  is  easily 
seen  to  be  complete. 

p=  Q 

asserts  there  is  a a p.o.  isomorphism:  P ->  Q.  Kunen  [80: 2 1 8ff]  has  an  equivalent 
definition  of  “complete”  and  a proof  of  the  following  proposition.  Toward  its 
statement,  for  X c P set 

fX)  = {q  e Q | 3peX(i(p)<q)}  . 

10.1  Proposition.  Suppose  that  i:  P — > Q is  complete. 

(a)  If  H is  Q-generic,  then  i~1(H)  is  P -generic  and  so  V [i  “ 1 (//)  ] C V[H\. 
If  i is  dense,  then  i(i~1(H))  = H and  so  V\i~l(H)\  — V[H\. 

(b)  If  i is  dense  and  G is  P-generic,  then  i{G)  is  Q-generic,  /_1(i(G))  = G, 

and  so  V[/(G)]  = V\G\  H 


The  arguments  are  straightforward  and  the  latter  assertions  follow  from  i e 
V.  Generic  extensions  are  often  identified  using  this  result  when  P is  actually 
isomorphic  to  a dense  subset  of  Q,  but  the  basic  process  of  passing  to  a separative 
quotient  (see  Jech  [03:204ff])  uses  the  weaker  hypotheses: 

For  a p.o.  P, 

P is  separative  iff  ip,  q e P(p  f q ->  3 r < p(r  _L  q))  . 

By  and  large,  p.o.’s  that  occur  in  practice  are  separative,  and  the  incentive  behind 
the  concept  is  to  insure  a certain  orderliness:  separative  p.o.’s  are  isomorphic  to 
a dense  subset  of  their  corresponding  complete  Boolean  algebras,  and  are  charac- 
terized by  a simple  but  useful  observation: 

10.2  Lemma.  A p.o.  P is  separative  iff  'ip,  q e P(p  < q p ||-  q e G).  H 


For  a p.o.  P and  p , q e P, 

P^q  iff  Vr  e P(r  ||  p r ||  q)  , 


and 


[p]  = {q  I q ^ p\  ■ 

~ is  an  equivalence  relation  on  P,  and  [p]  the  corresponding  equivalence  class 
of  p. 

The  separative  quotient  of  P is  P/&  = {[p]  | p e P] 

ordered  by 


[p]  < W iff  Vr  e P(r  < p ->  r ||  q)  . 


By  the  following  simple  observation  separative  p.o.’s  suffice  to  get  all  generic 
extensions: 
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10.3  Lemma.  The  map  sending  p to  [p]  is  a dense  embedding  of  P onto  P/&. 

H 


Familiarity  is  assumed  with  chain,  closure,  and  density  (distributivity)  con- 
ditions on  p.o.’s  and  their  salutary  effects  on  generic  extensions,  but  because  of 
some  variance  in  usage  definitions  are  given:  For  a p.o.  P, 

P has  the  K-chain  condition  (k- c.c.)  iff 

for  any  antichain  A in  P,  |A|  < AT  . 

P is  k -closed  iff  whenever  y < k and  {pu  | a < y}  C.  P 
with  pp  < pa  for  a < f < y,  there  is  a 
p e P such  that  p < pa  for  every  a < y . 

P is  K-Baire  iff  whenever  y < k and  Da  for  each  a < y 
is  dense  open,  so  is  f\<r  A*  ■ 

These  are  all  “less  than  k”  notions;  v+-Baire  is  also  called  k -distributive  because 
of  the  connection  with  distributivity  laws  in  Boolean  algebras.  Familiarity  is 
assumed  with  how  the  k- c.c  is  often  deduced  from  a combinatorial  fact  known  as 
the  A-system  Lemma  (see  Kunen  [80: 49ff]): 

10.4  Proposition  (Sanin  [46]  for  X — co).  Suppose  that  X < k,  k is  regular,  and 
a < k implies  that  a<x  < k.  If  \ A | — k and  x e A implies  that  \x\  < X,  then 
there  is  a B C A with  \B\  = k such  that  B forms  a A-system,  i.e. 

there  is  an  r such  that  for  distinct  x,yeB,xC\y  — r. 

In  particular,  any  uncountable  set  consisting  of  finite  sets  has  an  uncountable  subset 
that  forms  a A-system.  H 

It  is  well  known  that  chain  conditions  establish  upper  bounds  on  powers  of 
cardinals: 

10.5  Proposition.  Suppose  that  P is  a p.o.  with  the  k-c.c.,  and  for  a given  X,  set 
0 = (|P|<K)\  Then 

|[p  r2A  < 01  ■ H 

To  complete  the  review,  what  is  assumed  of  iterated  forcing , first  elaborated 
by  Solovay,  is  summarized  starting  with  product  forcing : Suppose  that  P and  Q 
are  p.o.’s  and  consider  the  product  p.o.  P x Q ordered  by 

iPo,qo)  < ( P\,q\ ) iff  Po  <P  Pi  A qo  <Q  qi  ■ 

Then  forcing  with  P x Q is  characterizable  as  an  iteration  (see  Kunen  [80: 252ff] 
or  Jech  [03: 229ff]): 

10.6  Proposition  (Solovay  [70]).  G is  P x Q-generic  iff  G = Go  x G\,  where 
Go  is  P-generic,  G i is  Q-generic  over  V[Go],  and  so  V[G ] = V[Go][Gi].  H 
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More  generally,  products  of  many  p.o.’s  can  be  taken.  Suppose  that  Pa  is  a 
p.o.  for  every  a < S.  For  p e nff<«5  p<*,  define 

supt {p)  — [a  < 8 | p( a)  11  /»„ } , 

the  support  of  p.  Then  the  k -product  of  (Pa  \ a < 8)  is 

{ P e I |supt(p)|  < k} 

ordered  by 

P < q iff  Va  < 8(p(a)  <Pa  q(a))  . 

For  example,  what  is  meant  by  “adding  X many  Cohen  reals”  is  forcing  with  the 
Ko-product  of  X copies  of  the  p.o.  for  adding  a Cohen  real.  The  Easton  product 
of  (Pa  | a < 8)  is 

[p  e na<spa  I VMA-  regular  ->  |supt(p)  n X\  < A.)} 

ordered  coordinate-wise  as  before.  This  product  was  devised  by  Easton  for  the 
first  forcing  argument  using  a product  of  many  p.o.’s,  indeed  a proper  class  of 
them: 


10.7  Theorem  (Easton  [64, 70]).  Suppose  that  GCH  holds  and  F is  a class  function 
from  the  class  of  regular  cardinals  to  the  class  of  cardinals  such  that  for  regular 
k < X:  F(k)  < F(X)  and  k < cf (F(k)).  Then  there  is  a generic  extension 
preserving  cardinals  and  cofinalities  in  which  2K  — F{k)  for  every  regular  k. 

The  forcing  was  with  the  Easton  product  of  (Pa  a e On)  with  Pa  the  usual 
notion  for  adjoining  F(a)  subsets  of  a if  a is  regular  and  trivial  otherwise.  The 
following  lemma  played  a crucial  role  in  the  product  analysis. 

10.8  Lemma.  Suppose  that  P and  Q are  p.o.  ’s  and  X is  such  that  P has  the 
X+-c.c.  and  Q is  X+ -closed.  If  G is  P x Q-generic  over  V with  G = Go  x G\  as 
in  10.6,  then  any  f:  X — »■  V in  V[G]  is  in  V[G o].  In  particular,  V(X)  n V[G ] = 

V(X)nV[G0].  H 

Of  course,  how  to  force  with  a proper  class  of  conditions  had  to  be  worked 
out  as  well.  For  details  on  this  and  product  forcing  in  general,  see  Jech  [03:  §15]. 

10.7  completely  solved  the  problem  of  determining  the  possible  values  of 
F(k)  = 2"  for  regular  k;  the  only  restrictions  are  that  F be  non-decreasing  and 
consistent  with  the  familiar  Konig  result:  k < cf(F(K)).  The  Singular  Cardi- 
nals Problem  is  to  determine  the  possible  values  of  2K  for  singular  k,  a much 
more  difficult  problem  which  is  far  from  being  solved.  It  is  remarkable  that  large 
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cardinals  are  closely  intertwined  with  this  basic  problem,  and  all  the  more  remark- 
able that  their  intervention  turns  out  to  be  a necessary  one  (see  volume  II). 

Suppose  that  one  wants  to  force  with  a p.o.  P to  get  U [Go]  and  then  with  a 
p.o.  QGo,  not  necessarily  in  V,  to  get  V[Go][Gi].  Then  one  might  as  well  assume 
||-p  Q is  a p.o.,  and  consider 

P *Q  = {{p,  q)  I p e P a lb p q e Q} 


ordered  by 


(Po,qo)  < {puq  1)  iff  Po  <p  Pi  A po  |bp  qo  <q  qi  . 

As  it  stands,  P * Q may  be  a proper  class  because  of  the  possible  q’s,  and  < is 
not  antisymmetric,  but  according  to  previous  remarks 

(po,qo)  and  ( p\,q\ ) are  identified  whenever  ( po,qo ) < (puqi)  < ( Po,qo ) , 

i.e.  po  — pi  A po  |bp  qo  — q\-  In  reasonable  formalizations  of  names,  there  are 
“set”  many  equivalence  classes  under  this  identification,  and  natural  representatives 
can  be  chosen  from  each  to  comprise  what  should  be  meant  by  P * Q (Kunen 
[80:269]).  Henceforth, 

it  is  assumed  that  P * Q is  a set 

under  such  a contraction,  but  (p,  q)  will  be  used  for  its  equivalence  class  to  avoid 
awkward  notation.  Under  a natural  identification,  every  /’-name  is  also  considered 
to  be  a P * Q- name.  Moreover,  after  justifying  p.o.  isomorphisms  like  R = P*Q, 
such  p.o.’s  are  often  identified  for  notational  convenience,  even  though  they  may 
not  be  extensionally  equal.  Forcing  with  P * Q is  characterizable  as  forcing  first 
with  P and  then  with  Q (see  Kunen  [80: 268ff]  or  Jech  [03: 267ff]): 

10.9  Proposition  (Solovay-Tennenbaum  [71]). 

(a)  Suppose  that  Go  is  P-generic  and  G\  is  QG° -generic  over  V[Go].  Set 

Go  * Gi  = {(p,  q)  e P * Q \ p e Go  a cjGo  e Gi)  . 

Then  Go  * G i is  P * Q-generic  and  V[Go  * Gi]  = V[Go][Gi]. 

(b)  Conversely,  suppose  that  G is  P * Q-generic,  and  set 

G0  = {p  e P | 3 q({p,  q)  e G)}  , and 
Gi  = [qGo  | 3p((p,q)  e G)}  . 

Then  Go  is  P-generic,  G \ is  QG° -generic  over  V[Go],  and  G = Go  * G\.  H 

10.6  is  essentially  the  special  case  of  this  result  where  Q is  Q for  some 
Q e V,  when  P x Q is  isomorphic  to  a dense  subset  of  P * Q. 

The  following  structure  theorem  applies  10.9(b)  to  show  that  every  ZFC 
model  intermediate  between  the  ground  model  and  a generic  extension  is  itself  a 
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generic  extension.  This  result  about  the  generality  of  forcing  has  a natural  proof  in 
terms  of  Boolean-valued  models  (see  Jech  [03: 247ff] ) but  may  also  be  established 
with  partial  orders. 

10.10  Proposition.  Suppose  that  R is  a p.o.,  G is  R-generic  over  V,  and  N is  ci 

transitive  e-model  of  ZFC  such  that  V C.  N c.  V[G ].  Then  there  is  a p.o.  P and 
a P-name  Q such  that  ||-/>  Q is  a p.o.  with  P * Q isomorphic  to  R so  that,  thus 
identifying  R with  P * Q,  if  Go  is  as  in  10.9(b),  V[G o]  — N.  H 

The  initial  observation  here  is  that  transitive  e-models  of  ZFC  are  determined 
by  their  sets  of  ordinals  (see  Jech  [03: 196ff])  so  that  one  can  deduce  that  N — V[x] 
for  some  x c On.  Serge  Grigorieff  [75]  investigated  the  situation  for  ZF  models 
N. 

10.9  was  first  used  in  an  argument  that  essentially  established  the  consistency 
of  Martin’s  Axiom  together  with  ->CFI.  This  well-known  axiom  is  briefly  reviewed: 
For  P a p.o.,  V a collection  of  dense  subsets  of  P,  and  Gcf, 

G is  V-generic  iff  (i)  Vp,  q e G3r  e G(r  < p A r < q)  , 

(ii)  Vp  e GVq  e P(p  < q ->  q e G)  , and 

(iii)  WD  e V(G  O D ± 0)  . 

For  any  k,  Martin ’s  Axiom  for  k (MA(k  j)  is: 

If  P is  a p.o.  satisfying  the  wi-c.c.  and  V is  a collection  of  at 
most  k dense  sets,  then  there  is  a G C P which  is  27-generic. 

Simply  Martin ’s  Axiom  (MA)  is: 

Vk  < 2k°  (MA(/c))  . 

The  clear  motivation  here  is  to  justify  internal  generic  objects  when  genericity  is 
restricted  to  small  collections  V e V of  dense  sets.  It  is  well-known  that  MA(«) 
is  a theorem  of  ZFC  and  that  MA(2S°)  and  MA  with  the  &>i-c.c.  restriction  deleted 
are  both  inconsistent. 

10.11  Theorem  (Solovay-Tennenbaum  [71]).  Suppose  that  k is  a cardinal  sat- 

isfying k<k  — k > co.  Then  there  is  a a>\-c.c.  p.o.  P of  cardinality  k such  that 
|[p  MA  A 2k°  =k.  H 

In  particular,  starting  with  a model  satisfying  GCH,  MA  can  be  arranged 
with  2k°  any  prescribed  uncountable  regular  cardinal.  For  details  on  MA  see  Jech 
[03:  §16],  or  Kunen  [80:II§2]  where  it  is  axiomatically  presented  before  forcing 
per  se  is  discussed.  Fremlin  [84]  is  a portmanteau  book  on  the  consequences  of 
MA. 


10.  Development  of  Forcing 


125 


Mild  Extensions 

To  test  the  waters,  some  straightforward  observations  about  forcing  and  large 
cardinals  are  discussed  which,  while  not  noticed  at  the  outset,  have  a direct  bearing 
on  Godel’s  earlier  speculations  on  the  Continuum  Problem  (§3).  Forcing  provided 
the  means  to  show  large  cardinals  do  not  decide  the  power  of  the  continuum. 
Cohen  himself  had  noted  ([64:  110])  that  his  argument  for  the  independence  of 
CH  preserves  inaccessible  and  (“It  seems  probable”)  Mahlo  cardinals,  and  Levy 
and  Solovay  showed  that  measurable  cardinals  are  also  preserved.  In  fact,  their 
argument  is  a generic  one  for  showing  that,  by  and  large,  large  cardinals  k are 
preserved  in  any  mild  extension,  i.e.  via  a forcing  with  a p.o.  of  cardinality  less 
than  k.  The  following  useful  results  pursue  this  theme  through  several  cases: 

10.12  Proposition.  Suppose  that  k is  inaccessible  and  P is  a p.o.  such  that 
| P\  < k.  Then 

||-/>  k is  inaccessible  . 

Proof.  This  follows  from  the  |P|+-c.c.:  ||-  k is  regular,  and  10.5  implies  that  ||-  k 
is  a strong  limit.  H 

10.13  Proposition.  Suppose  that  k is  Mahlo  and  P is  a p.o.  such  that  P < k. 
Then 

||-p  k is  Mahlo  . 

The  proof  relies  on  a generally  useful  observation: 

10.14  Lemma.  Suppose  that  k > co  is  regular,  P is  a p.o.  with  the  k-c.c.,  p e P, 
and  p \\-  C is  closed  unbounded  in  k.  Then  there  is  a Co  closed  unbounded  in  k 
such  that  p ||-  Co  C C. 

Proof.  For  a < k,  set 

Xa  — {§  < k | 3q  < p(q  ||-  f is  the  least  member  of  C — (a  + 1))}  . 

By  the  /c-c.c.,  |Z„|  < k,  so  set  /(a)  = supIX,*)  < k.  Finally,  let 
Co  = {/3  < k | V«  < P(f(a)  < yS)}  , 
a closed  unbounded  set. 

To  show  that  p ||-  Co  c C,  assume  to  the  contrary  that  there  is  a q < p and 
a P e Co  such  that  q \[  fi  £ C.  Then  for  some  r < q and  a < /J,  r \\-  a = 
sup(C  (T  /8).  For  some  s < r and  % e Xa,  s ||-  f is  the  least  member  of  C — (a+ 1). 
But  § < p as  b e Co,  contradicting  the  choice  of  a.  H 

Proof  of  10.13.  Suppose  that  p ||-  C is  closed  unbounded  in  k.  By  10.12,  if  1 is 
inaccessible  and  | P I < K then  ||-  X is  inaccessible.  Since  k is  Mahlo,  by  10.14 
there  are  such  X < k satisfying  p \\-  X e C.  ~\ 
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10.15  Proposition  (Levy  [64],  Solovay  [65a],  Levy-Solovay  [67]).  Suppose  that 
k is  measurable  and  P is  a p.o.  such  that  \ P\  < at.  Then 

|[p  k is  measurable  . 

Proof.  Let  U be  a a: -complete  ultrafilter  over  k,  and 

lh  U = {Y  c K | 3X  e U(X  c y)}  , 

the  filter  generated  by  U.  It  will  in  fact  be  shown  that  ||-  U is  a Ar-complete 
ultrafilter  over  k.  The  following  claim  can  serve  a larger  purpose: 

(*)  If  p ||-  r:  k — >■  V,  then  there  are  q < p,  Z e U , 
and  f:Z—>V  such  that  q \\-  i\Z  — f . 

To  show  this,  for  each  r < p let  Xr  = {£  e k | 3 x(r  ||-  r(|)  = x)}.  Then 
|Jr<  Xr  = k and  | P\  < k,  so  by  k -completeness  there  is  a q < p such  that  Xq  e 
U.  But  then,  for  /:  Xq  V defined  by  f(f)  = that  x such  that  q ||-  r(f)  = x 
(unique  as  r is  forced  to  be  a function),  it  follows  that  q ||-  i\Xq  — f. 

To  complete  the  proof,  note  first  that  ||-  U is  non-principal,  so  it  remains  to 
verify  that  ||-  U is  ultra  and  a: -complete.  For  this  it  suffices  by  simple  arguments 
(cf.  2.7)  to  show  that  if  y < k and  ||-  r : at  ->  y,  then  there  is  an  a such  that 
II- i-fla})  e U: 

By  the  claim,  there  are  Z e U and  /:  Z ->  y such  that  ||-  r|Z  = /.  But 
U is  ^-complete,  so  there  is  an  a < y such  that  A = f_1([a})  e U.  Hence, 
j[lnzcr-‘({ff})e(/.  H 

A further  consequence  of  the  claim  (*)  is  that  if  U is  normal,  then  ||-  U is 
normal. 

10.16  Exercise  (Levy-Solovay  [67]).  10.15  holds  with  “ measurable  ” replaced  by 
"Ramsey”  or  by  “ weakly  compact”. 

Hint.  For  Ramsey,  if  p |[  r:  [at]<<u  -*  2,  define  g : [at]<&'  -»  V(P  x 2)  by 
gfy)  = {(^r,  i)  | q < p a q \[  i:(s)  — ;}.  Show  that  if  H is  homogeneous  for  g, 
then  p ||-  7/  is  homogeneous  for  r.  H 

Similar  arguments  can  be  devised  for  a variety  of  large  cardinals.  The  problem 
of  preserving  large  cardinals  when  forcing  with  p.o.’s  of  large  cardinality  was  to 
lead  to  important  developments  both  in  the  theory  of  forcing  as  well  as  large 
cardinals. 

The  Levy  Collapse 

The  first  announcement  after  Cohen’s  breakthrough  of  results  using  forcing  was 
made  four  months  later,  and  notably  one  result  involved  an  inaccessible  cardinal. 
Motivated  by  questions  of  definability  Levy  [63,70]  formulated  a gentle  collapse 
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of  an  inaccessible  cardinal  that  has  become  basic  to  the  theory,  and  experience  has 
shown  that  it  often  suffices  for  relating  large  cardinals  with  combinatorial  proper- 
ties of  &> i and  a>2  in  equiconsistency  results.  For  clarity  and  later  convenience,  a 
general  setting  is  first  pursued: 

For  S c On  and  A regular, 

Col(A,  S ) = {p  | p is  a function  A \p\  < A A dom(/;)  c S x A 
A V(a,  §}  e dom(p)(p(a,  f ) = 0 V p(a,  £)  e a)} 

ordered  by  p < q iff  p 3 q.  Thus,  Col(A,  .S')  is  a p.o.  for  generically  adjoining 
surjections:  A ->  a for  every  a e S.  Col(o>,  {cw})  is  essentially  Cohen’s  original 
p.o.  for  adjoining  a subset  of  a>,  a Cohen  real;  Col(A,  {A})  its  generalization  for 
adjoining  a Cohen  subset  of  A;  and  Col(A,  {at})  for  k > A the  standard  p.o.  for 
collapsing  \k\  to  A.  The  following  lemma  lists  some  elementary  properties: 

10.17  Lemma. 

(a)  Col(A,  S ) is  X-closed. 

(b)  Suppose  that  S = XUf  is  a disjoint  union,  and  set  Pq  — Col(A,  X)  and 
P\  — Col(A,  Y).  Then  G is  Col(A,  S)-generic  iff  G — {pGq  | p e GoAq  e Gi}, 
where  Go  is  Po-generic,  Col(A,  T)y[G°]  = Pu  and  G\  is  P\-generic  over  V[Go]. 

(c)  If  k is  regular,  k > A,  and  either  k is  inaccessible  or  A = w,  then 
Col(A,  k ) has  the  k-c.c. 

(d)  If  Col(A,  k)  has  the  k-c.c.,  then  forcing  with  it  preserves  cardinals  < A 
and  > k. 

(e)  Suppose  that  k is  regular,  Col(A,  k)  has  the  k-c.c.,  and  G is  Col(A,  k)- 
generic.  Then  for  any  x e V[G ] with  x : y ->  On  where  y < k,  there  is  a S < k 
such  that  x e V[G  P\  Col(A,  <5)]. 

Proof,  (a)  This  is  obvious. 

(b)  This  follows  from  10.6,  since  the  association  of  p e Col(A,  S)  with 
{p\(X  x X),  p\(Y  x A)}  is  an  isomorphism  between  Col(A,  S)  and  the  product 
p.o.  Po  x Pi;  that  Col(A,  Y)v^  = Pi  follows  from  (a)  since  forcing  with  P0 
adds  no  new  ordinal  sequences  of  length  less  than  A. 

(c)  This  follows  by  a typical  application  of  the  /d-system  lemma  10.4:  Given 
{pp  | f < k}  c Co1(A,at),  the  lemma  together  with  the  hypothesis  imply  that 
there  is  an  X e [at]'1  and  an  r c k x A such  that  fi  f y both  in  X implies  that 
dom(/7^)fldom(py)  = r.  By  definition  of  Col(A,  k)  and  |r|  < k,  there  is  a further 
Y e [X]K  such  that  f f y both  in  Y implies  that  pp  \ r = pY  \ r.  But  for  such  p 
and  y,  pp  and  pY  are  compatible. 

(d)  This  uses  (a). 

(e)  Let  x be  such  that  xG  = x , and  for  each  a < y let  Aa  c Col(A,  k)  be  a 
maximal  antichain  such  that  for  each  p e Aa  there  is  a f such  that  p ||-  i(a)  = 
|A„|  < k for  each  a < y by  the  k-c.c.  so  that  lUa<y^“l  < K > ancl  hence 
by  the  regularity  of  k there  is  a 8 < k such  that  p e {Ja<y  Aa  implies  that 
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dom(/;>)  C <5x1.  Finally,  x is  definable  from  G fl  Col(k,  5)  by:  x(a)  = £ iff 
p ||-  i(a)  = f for  the  unique  p e G fl  Aa.  H 

For  k > k, 

Col(A.,  k ) is  a Levy  collapse  , 
and  when  it  has  the  k-c.c., 

Col(k,  k ) collapses  k to  k+ 

because  of  10.17(d).  This  is  part  of  the  parlance;  forcing  with  Col(A.,/r)  does 
not  collapse  c as  a cardinal,  but  gently  renders  it  the  successor  cardinal  of  X 
by  collapsing  all  cardinals  strictly  between  k and  k.  Levy’s  original  p.o.  was 
Col(«,  k)  with  k inaccessible,  and  usually  10.17(c)  is  invoked  with  inaccessible 

K. 

Digressing  briefly,  a proof  is  next  provided  of  8.16,  the  equiconsistency  of 
Rowbottom  and  Jonsson  cardinals.  One  direction  is  immediate,  as  Rowbottom 
cardinals  are  Jonsson,  so  the  next  proposition  completes  the  proof.  It  is  established 
by  a mild  extension  argument  with  two  twists:  10.17(c)  is  invoked  for  k — a>,  and 
the  large  cardinal  property  to  be  preserved  does  not  entail  (strong)  inaccessibility. 

10.18  Proposition  (Kleinberg  [72, 79]).  If  k is  the  least  Jonsson  cardinal,  then 
there  is  a p.o.  P such  that 


|[p  k is  Rowbottom  . 

Proof.  By  8.15  there  is  a regular  v < k such  that 

(*)  k is  v-Rowbottom  and  ac— for  any  a < v . 

By  10.17(c),  P — Col(o),  v)  has  the  v-c.c.,  and  so  renders  v — co i in  any  generic 
extension.  Hence,  it  suffices  to  show  that 

|[p  k is  v-Rowbottom  . 

To  this  end,  suppose  that  p e P,  y < k,  and  p ||-  /:  [k]<<w  — >■  y.  For  each 
s e [k]<w  let  A.s  cfbea  maximal  antichain  below  p such  that  for  each  r e As 
there  is  an  ay  < y such  that  r ||-  f(s)  = ay.  |A.S|  < v by  the  v-c.c.,  so  by  (*) 
there  is  a gs:  [k]<w  -»  As  such  that  for  any  X e [v]*,  gs“X<0)  = As.  Now  define 
h\  [at]<®  —>  y by  setting  h(%\,  = 0 unless  n — 2'V  for  some  i,  j > 0,  in 

which  case: 


h{H  l,  ...,£„)=  the  o'  such  that  g{fl,...,fe}(^+i, . . . , &+j)  ||-  f(£\,  = a . 

By  (*)  there  is  an  H e [ac]k  such  that  setting  E = h“[H]<0),  \E\  < v.  To 
complete  the  proof,  it  suffices  to  show  that  p ||-  f“[H]<01  c E. 

To  do  this,  suppose  that  q < p,  and  for  some  s e [H]<a>  and  a < y, 
q ||-  f{s)  — a.  By  definition  of  As,  there  is  an  r e As  compatible  with  q,  and  since 


10.  Development  of  Forcing 


129 


p ||-  / is  a function,  a — ar.  By  definition  of  gs,  there  is  a t e [H— (max(s)+l)]<<u 

such  that  gs(t)  = r.  Finally,  if  ^ = [fi , f,)  and  t = {f,+i f(+/),  set 

n — 2'V  and  choose  Hi+j+\  < ■■  ■ < Hn  all  from  H — (max(?)  +1).  Then  it  is 
simple  to  check  that  /i(fi, . . . , f„)  = a e E,  and  so  the  proof  is  complete.  H 

Returning  to  the  main  development,  Levy’s  initial  result  with  his  collapse  is 
discussed  in  the  context  of  Solovay’s  result  in  §11,  and  both  use  a fundamental 
property  of  Col(k,  S).  For  a p.o.  P, 

P is  weakly  homogeneous  iff  for  any  p,q  e P there  is  an  auto- 
morphism e of  P such  that  e(p)  ||  q . 

This  property  arose  in  the  independence  proofs  for  AC.  The  “weakly”  acknowl- 
edges that  a Boolean  algebra  B is  called  homogeneous  if  for  a,  b e B with  neither 
a nor  b equal  to  either  the  zero  or  the  one  of  B,  there  is  an  automorphism  e of  B 
such  that  e(a)  — b.  The  first  part  of  the  next  proposition  is  the  raison  d’etre  for 
weak  homogeneity. 

10.19  Proposition. 

(a)  If  a p.o.  P is  weakly  homogeneous,  then  for  any  formula  q>{v\ , . . . , v„) 
in  the  forcing  language  and  x\,...,xn  e V,  either  ||-p  <p(x \,...,xn)  or  else 
Ibp  -><p(x i,  ...,x„). 

(b)  Col(k,  S ) is  weakly  homogeneous. 

Proof,  (a)  Assume  to  the  contrary  that  p ||-  <p(x i, . . . , x„)  and  q ||-  —><p(x\,  . . . , xn ) 
for  some  p,q  e P,  and  let  e be  an  automorphism  of  P such  that  e(p)  ||  q. 
e induces  a bijection:  P-names  -*  P-names  that  fixes  x for  every  x e V and 
preserves  the  forcing  relation  (see  Kunen  [80: 222]).  Thus,  e(p)  ||-  <p(x\, . . . , xn ), 
but  this  contradicts  e(p)  ||  q. 

(b)  Given  p,  q e Col(k,  S ) it  is  easy  to  find  a bijection  /:  A.  — > X such  that 
for  (a,  %)  e dom(p)  and  (/S,  0)  e dom(g),  /(f)  f f f induces  an  automorphism 
e of  Col(k,  S ) that  to  r e Col(k,  S ) associates  e(r)  specified  by 

(a,  /(f))  e dom(e(r ))  iff  {a,  f>  e dom(r)  A e(r)(a,  /(f))  = r(a,  f)  . 

Since  dom(e(p))  fl  dom(^)  = 0,  e(p)  ||  q. 

Finally,  Solovay  established  a result  specific  to  Col(tt>,  k)  for  his  Lebesgue 
measurability  result  (§11).  The  natural  proof  uses  what  amounts  to  a characteri- 
zation of  Col(cu,  {a})  of  intrinsic  interest: 

10.20  Proposition.  Suppose  that  P is  a separative  p.o.  such  that  |P|  < |a|  and 

|[p  3 /(/:  a)  — > a is  surjective  A f £ V)  . 

Then  there  is  an  injective,  dense  embedding  of  a dense  subset  of  Col(ft>,  {a})  into 

P. 
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Remarks.  The  hypothesis  is  dichotomous  in  that  /:&>->  a being  surjective  is 
only  material  when  a is  uncountable,  and  f f V,  when  a is  countable.  For 
a — co,  the  result  densely  associates  any  non-trivial  countable  p.o.  with  the  Cohen 
p.o.  Col(<u,  {<w}).  Finally,  by  this  result  and  10.1,  the  generic  extensions  by  such 
P coincide  with  the  generic  extensions  by  Col(&>,  {a});  the  separativity  of  P is 
extraneous  in  this  regard  since  the  separative  quotient  can  first  be  taken  and  10.3 
invoked. 

Proof.  Set  v — |a|.  Note  first  that  the  hypotheses  imply  that  for  any  p e P there 
is  a maximal  antichain  below  p of  cardinality  v (so  that  in  particular  | P\  = v ): 
If  v — co,  this  follows  from  \[  f f V since  any  condition  must  have  incompatible 
extensions.  If  v > co,  this  follows  from  the  collapse  of  v to  co  and  the  consequent 
failure  of  the  v-c.c. 

The  dense  subset  of  Col(a>,  {a})  in  question  is 

Da  — {p  | 3 n e co(p\  {a}  x n ->  a )}  , 

i.e.  those  members  of  Col(<u,  {a})  which  are  not  only  finite  but  have  domains  of 
form  {a}  x n for  some  n e co.  By  the  maximal  principle,  there  is  a term  g such 
that  ||-  g:  co  — > G is  surjective  (where  G is  the  canonical  name  for  the  generic 
object).  Proceed  now  by  recursion  on  \p\  for  p e Da  to  define  an  injective,  dense 
embedding  e:  Da  — > P : 

Set  e(0)  = Up.  Having  defined  e(p)  for  a p e Da  with  dom(p)  = {a}  x n, 
let  (ajj  | £ < a)  enumerate  a maximal  antichain  below  e(p)  of  cardinality  v.  By 
refining  if  necessary  it  can  be  assumed  that  each  aj!  ||-  g(n)  — r for  some  r e P. 
Set  e(p  U {((a,  n ),  f )})  = aj?  for  each  § < a. 

Clearly,  this  definition  provides  an  isomorphism  e:  Da  — >■  <?“/)„.  To  show 
that  e“Da  is  dense  in  P,  let  r e P be  arbitrary.  Since  r ||-  r e G by  separativity, 
there  is  an  s < r and  an  new  such  that  s ||-  g(n)  — r.  By  construction  there  is 
a p e Da  with  \p\  = n + 1 such  that  e(p)  ||  s,  and  since  e(p)  had  also  decided 
g(n),  e(p)  ||-  g(n)  — r.  But  then,  e(p)  || - r e G,  and  so  by  separativity  and  10.2, 
e(p)  < r.  H 

10.21  Proposition  (Solovay  [70]).  Suppose  that  k > co  is  regular  and  G is 
Col(<w,  K)-generic.  Then  for  any  x e V[G]  with  x : co  ->  On,  there  is  an  PI  which 
is  Col(&),  K)-generic  over  V[x]  such  that  V[G ] = V[^][H]. 

Proof.  By  10.17(e),  there  is  a 8 < k such  that  x e V[G  IT  Col(tt),  S)].  Set 

Go  = G 0 Col(tt),  5)  , 

G\  = G n Col(ft>,  {^})  , and 
G2  = G 0 Col(&>,  k-(S+  1))  . 

By  10.17(b),  Go  is  Col(&>,  5)-generic,  and  so  by  10.3  and  10.10  there  is  a separative 
p.o.  P e V[x]  and  a P-generic  lit,  over  V[x]  such  that  V[x~\[Pl o]  = V[Go].  Argue 
in  V[x]: 
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Set  Q — P xCol(«,  {5}).  By  10.6,  V[.r][//o][Gi]  is  a generic  extension  over 
V[x ] using  Q.  Clearly,  \Q\  < |Col(&>,(5  + 1)|  = |<5|,  and  because  of  the  second 
component  of  Q,  \\-q  3 /(/:  a>  — > S is  surjective  A f g V).  Hence,  by  10.20 
there  is  an  H\  Col(&>,  {<5})-generic  over  V[x ] such  that  V[x][H\]  — V[x][Ho][Gi]. 
Also  by  10.20  applied  to  the  p.o.  Col(a»,  <5+l),  there  is  an  Hi  Col(&>,  <5+l)-generic 
over  V[x ] such  that  V[x][i/2]  = V[x][H\], 

Finally,  by  several  applications  of  10.17(b), 

V[G]  = V[G0][Gi][G2]  = V[x][if0][Gi][G2]  = V[x][H2][G2\ 
and  H2  U G2  is  Col(&>,  K)-generic  over  V[x],  H 

That  any  countable  sequence  of  ordinals  can  be  “absorbed”  into  the  ground 
model  in  this  way  plays  an  important  role  in  the  next  section. 
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11.  Lebesgue  Measurability 

This  section  is  devoted  to  the  following  result: 

11.1  Theorem  (Solovay  [65b,  70]  ).  Suppose  that  k is  an  inaccessible  cardinal  and 
G is  Col(<y,  K)~generic.  Then  V[G ] has  an  inner  model  satisfying: 

(a)  Every  set  of  reals  is  Lebesgue  measurable. 

(b)  Every  set  of  reals  has  the  Baire  property. 

(c)  Every  set  of  reals  has  the  perfect  set  property. 

(d)  The  Principle  of  Dependent  Choices  (DC). 

Solovay  established  this  result  in  the  Spring  of  1964,  just  a year  after  Cohen’s 
creation  of  forcing.  Cohen  had  in  fact  mentioned  the  possibility  of  (a)  in  a lec- 
ture in  July  1963  attended  by  Solovay.  The  result  was  remarkable  for  its  early 
sophistication  and  revealed  what  standard  of  argument  was  possible  with  forcing. 

Like  Cohen’s  results,  11.1  resolved  classical  problems  dating  back  to  the  turn 
of  the  century  and  before.  Lebesgue  measurability  and  the  Baire  property  were 
formulated  in  §0.  As  discussed  in  §2  Lebesgue  had  intended  his  measure  as  a 
solution  to  his  Measure  Problem,  but  Vitali  in  an  early  explicit  use  of  the  Axiom  of 
Choice  constructed  a non-Lebesgue  measurable  set  of  reals  from  a well-ordering 
of  the  reals.  In  fact,  it  can  be  shown  that  Vitali’s  set  does  not  possess  the  Baire 
property  either.  (Another  counterexample  is  described  after  11.4.)  However,  the 
substance  and  import  of  11.1  is  ensured  by  its  (d),  the  Principle  of  Dependent 
Choices'. 

(DC)  VXVR(X /0a«cXxXaVj6  A3  v e X((x,  y)  e R 
->  3 fe  wXWn  e «((/(«),  fin  + 1)>  e R))  . 

Formulated  by  Bemays  [42:  86]  (with  a precedent  in  a short-lived  journal,  Oswald 
Teichmiiller  [39:  568])  this  choice  principle  readily  implies,  and  in  fact  is  equiva- 
lent to  having,  the  useful  characterization  of  well-foundedness  in  terms  of  the  lack 
of  infinite  descending  chains.  Moreover,  it  readily  implies  the  Countable  Axiom 
of  Choice, 

(AC,a)  Every  countable  set  consisting  of 

non-empty  sets  has  a choice  function  , 

and  in  turn  the  regularity  of  co\.  ( Jensen  [66]  showed  that  AC(U  does  not  imply 
DC.)  Being  handily  sufficient  for  the  development  of  measure  theory,  DC  bolsters 
Solovay’s  model  as  a bona  fide  one  for  mathematical  analysis.  Lebesgue’s  solution 
was  thus  vindicated  by  11.1  to  the  extent  that  it  showed  that  some  substantial  use 
of  AC  is  necessary  to  produce  a non-measurable  set. 

Lebesgue  measurability  and  the  Baire  property  are  often  linked  together,  and 
the  proofs  of  11.1(a)  and  (b)  have  parallel  arguments.  On  the  other  hand,  the 
following  exercise  highlights  how  being  null  and  being  meager  are  quite  different 
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indications  of  a set’s  sparsity.  In  it  and  throughout  this  section,  the  terminology 
developed  in  §0  for  measure  and  category  is  put  into  service;  in  particular,  by 
“real”  is  usually  meant  member  of  ‘°co;  mL  denotes  Lebesgue  measure;  and 

{s i | i e a>)  is  a fixed  enumeration  of  <m&> . 

11.2  Exercise  (ZF).  There  is  a null  set  N of  reals  such  that  ® u>  — N is  meager. 

Hint.  First,  for  j < a>  find  a Dj  C such  that  wa>  — Dj  is  nowhere  dense  and 
mL(Dj)  < jfij- . For  example,  for  any  .?  e <roco  let  s *n  be  .s  concatenated  with  n 
zero’s,  and  set  Dj  — [Jj€eo  0(si*  fj(i ))  for  some  fj  e wco  sufficiently  fast-growing 
so  that  mL(Dj ) < Then  set  N = P|J<a)  Dj. 

The  perfect  set  property  dates  back  to  Cantor’s  original  efforts  to  establish 
his  Continuum  Hypothesis.  For  x e ‘ °co  and  A C x is  a limit  point  of  A iff 
for  any  O(s)  with  x e O(s),  (A  fl  Ofs))  — {x}  0.  x is  an  isolated  point  of  A iff 

x e A and  is  not  a limit  point  of  A,  i.e.  there  is  an  O(s)  such  that  A fl  O(s)  — {x}. 
For  A C 01  (a, 

A is  perfect  iff  it  is  nonempty,  closed,  and  has  no  isolated  points  . 

Cantor  formulated  this  concept  in  his  topological  investigations  of  R.  Its  relevance 
to  the  Continuum  Hypothesis  can  be  seen  from  the  following  observations: 

11.3  Proposition  (ZF). 

(a)  (Cantor  [83:  575][84: 471],  Bendixson  [83])  For  any  uncountable  closed 
set  C of  reals,  there  is  a perfect  set  P c C such  that  C — P is  at  most  countable. 

(b)  (Cantor  [84a])  Any  perfect  set  of  reals  has  cardinality  2K°. 

Proof,  (a)  For  any  A C “a,  let  A'  = the  limit  points  of  A (Cantor’s  [72]  concept 
of  the  derived  set  of  A).  It  is  simple  to  check  that  A'  is  always  a closed  set,  and  for 
A closed,  that  A'cA.  Starting  with  an  uncountable  closed  set  C of  reals,  define 
(C„  | of  < u>\)  by:  C0  = C,  Ca+ 1 = C'a,  and  for  limit  S > 0,  C5  — Plo-c?  Ca- 
Then  X = [J a<a)fCa  — Ca+ 1)  is  at  most  countable:  If  x e X,  noting  that  x is  an 
isolated  point  of  exactly  one  C„  let  /(x)  be  the  least  i such  that  0(s;)nCa,  = [x]. 
It  is  simple  to  see  that  /:  X ->  a>  is  injective. 

Since  X is  at  most  countable,  there  must  be  a y < u>\  such  that  CY+ i = 
Cy  0.  Then  P — Cy  is  perfect,  and  C — P = X. 

(b)  Suppose  that  P c is  perfect.  Then  there  are  p,  e P for  every 
t e <w2  and  a strictly  increasing  sequence  (n,-  \ i e a>)  of  integers  such  that  for 
any  t,  u € <u>2, 

(i)  if  t c u,  then  p,\nw  = pu\n\,\  , and 

(ii)  if  \t\  = \u\  yet  t ± u,  then  p,\nw  ± pu\n\t\  ■ 

This  can  be  established  by  a straightforward  recursion  on  \t\.  For  each  x e "2, 
let  px  = lim„{pv|n  | n e a>}  in  the  natural  sense  based  on  (i).  Then  px  e P since 
P is  closed,  and  x f y implies  that  px  pv  by  (ii).  H 
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Cantor  thus  verified  the  Continuum  Hypothesis  for  closed  sets:  every  closed 
set  is  either  countable  or  has  cardinality  2K° . Put  another  way,  he  had  reduced 
his  efforts  to  finding  a closed  set  of  reals  of  cardinality  Ki.  Through  Cohen’s 
work  it  is  now  known  that  this  is  impossible  in  ZFC,  but  Cantor  had  initiated 
a programmatic  approach  to  the  Continuum  Problem  based  on  the  topological 
complexity  of  sets  which  was  to  be  vigorously  pursued  in  descriptive  set  theory. 

For  any  A C l0u>, 

A has  the  perfect  set  property  iff 

A is  countable  or  else  has  a perfect  subset . 

Like  Lebesgue  measurability  and  the  Baire  property,  the  perfect  set  property  can 
be  regarded  as  a property  indicative  of  well-behaved  sets,  and  eventually  all  Borel 
sets  and  more  were  shown  to  possess  it.  However,  Bernstein,  who  was  initially 
a student  of  Cantor  at  Halle,  showed  that  a modest  form  of  the  Axiom  of  Choice 
already  confronts  this  approach  to  the  Continuum  Problem.  In  1897  when  just 
19,  Bernstein  had  established,  with  a readily  avoidable  use  of  AC,  what  is  usually 
known  as  the  Schrdder-Bemstein  Theorem:  if  | A\  < |B|  and  |B|  < |A|,  then  there 
is  a bijection  between  A and  B.  The  sense  of  this  derives  from  < for  “cardinals” 
in  the  ZF  setting  denoting  injective  embeddability.  In  particular,  co \ < 2K°  means 
there  is  an  uncountable,  well-orderable  set  of  reals.  Bernstein’s  argument  for  the 
perfect  set  property  can  be  analyzed  as  follows: 

11.4  Proposition  (ZF)(Bemstein  [08]). 

(a)  If  the  reals  are  well-orderable,  then  there  is  a set  A of  reals  of  cardinality 
2^°  such  that  P fl  A f 0 and  P — A f 0 for  any  perfect  set  P of  reals. 

(b)  If  a>\  < 2S°,  then  there  is  a set  of  reals  without  the  perfect  set  property. 

Proof,  (a)  By  a straightforward  counting  argument  using  the  Schrdder-Bemstein 
Theorem  there  are  exactly  2K°  perfect  sets  (the  easier  < 2K°  suffices  for  the  proof). 
Using  a fixed  well-ordering  of  "co  let  ( Pa  \ a < 2K°)  be  an  enumeration  of  the 
perfect  sets  and  define  aa  f ba  for  a < 2K°  as  follows:  If  ap,bp  have  been 
defined  for  p < a,  let  aa  be  the  least  a in  the  well-ordering  such  that 

a e Pa-  {{ap  \ P < a}  U {bp  \ P < a}) 
and  then  let  ba  be  the  least  b in  the  well-ordering  such  that 
b e Pa-  ({ ap  | p < a}  U {bp  \ P < a})  . 

This  is  possible  by  11.3(b).  The  desired  A can  be  taken  to  be  {aa  \ a < 2K° }. 

(b)  If  2k°  f w i,  then  by  the  hypothesis  and  11.3(b)  there  is  a well-orderable 
set  of  reals  without  the  perfect  set  property.  If  2K°  < co\ , then  by  hypothesis  wco 
is  bijective  with  a>\  by  the  Schrdder-Bemstein  Theorem,  and  so  the  conclusion 
follows  from  (a).  H 

A remark  of  Bernstein’s  (p.  330)  is  interpretable  as  asserting  that  the  denial 
of  the  perfect  set  property  in  the  strong  sense  of  (a)  leads  to  a non-Lebesgue 
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measurable  set.  This  is  simple  to  see,  since  if  A were  measurable,  either  rnL(A)  > 
0 or  mL(wco  — A)  > 0,  say  the  first.  By  0.9(a)  there  would  be  a closed  set  C c A 
such  that  mL(C ) > 0,  so  by  11.3(a)  a perfect  set  PCC,  which  is  a contradiction. 

Moreover,  the  set  A cannot  have  the  Baire  property  either:  Suppose  other- 
wise; by  taking  its  complement  if  necessary  it  can  assumed  that  A is  not  meager. 
By  0.12(b)  there  is  a G$  set  X C A such  that  A — X is  meager.  But  X is  uncount- 
able as  A is  not  meager,  yet  any  uncountable  Borel  set  contains  a perfect  subset 
(by  12.2(c)),  which  is  a contradiction. 

The  perfect  set  property  led  to  the  first  instance  of  an  important  phenomenon 
in  set  theory:  the  derivation  of  equiconsistency  results  based  on  the  complementary 
methods  of  forcing  and  inner  models.  A large  cardinal  hypothesis  is  typically 
transformed  into  a proposition  about  sets  of  reals  by  forcing  that  collapses  the 
cardinal  to  a>\  or  enlarges  the  power  of  the  continuum  to  the  cardinal.  Conversely, 
the  proposition  entails  the  same  large  cardinal  hypothesis  in  the  clarity  of  an  inner 
model.  Thus,  in  a subtle  synthesis,  hypotheses  of  length  concerning  the  extent 
of  transfinite  are  correlated  with  hypotheses  of  width  concerning  the  fullness  of 
power  sets.  Solovay’s  11.1  provided  the  forcing  direction  from  an  inaccessible 
cardinal  to  the  proposition  that  every  set  of  reals  has  the  perfect  set  property. 
Notably,  Specker  [57]  had  already  made  the  conceptual  move  to  inner  models; 
through  a sequence  of  implications  he  had  in  effect  established  in  ZF  that  if  every 
set  of  reals  has  the  perfect  set  property  and  co\  is  regular,  then  co\  is  inaccessible  in 
L (cf.  Mycielski  [64:213]).  Thus,  Solovay’s  use  of  an  inaccessible  cardinal  was 
necessary  for  getting  ll.l(c)(d),  and  its  collapse  to  w\  complemented  Specker’ s 
observation. 

Specker’ s conclusion  can  be  readily  strengthened  to  w\  is  inaccessible  from 
reals: 

Yu  e < co\)  . 

This  soon  became  a focal  hypothesis,  and  the  description  is  justified  by  a simple 
self-refinement: 

11.5  Proposition  (ZF).  If  a>i  is  regular  and  Va  e <Ba>(a>f^  < w\),  then 
Yu  e wa)(a>i  is  inaccessible  in  L[a])  . 

Proof.  Assume  to  the  contrary  that  for  some  a e ®a»,  ix>\  is  not  inaccessible 
in  L[a\.  Since  L[a]  satisfies  GCFI  by  the  same  proof  as  for  L,  a>\  must  be  a 
successor  cardinal  in  L[a\,  say  w \ = p+L[a].  Since  p is  countable,  let  b e a>co 
code  a well-ordering  of  a>  of  ordertype  p.  But  if  c e w(o  codes  a and  b,  then 

= «i,  which  is  a contradiction.  H 

Finally,  Levy’s  initial  result  with  his  collapse  Col(®,  k)  can  be  brought  into 
the  scheme  of  things;  he  essentially  observed  the  equiconsistency  of  (a)  and  (c) 
below: 
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11.6  Theorem  (Solovay  [65b,  70],  Specker  [57],  Levy  [63,70]).  The  following 
theories  are  equiconsistent: 

(a)  ZFC  + 3 k(k  is  inaccessible). 

(b)  ZF  + DC  + 1 every  set  of  reals  has  the  perfect  set  property1 . 

(c)  ZF  + w\  is  regular  + w\  2K°. 

(d)  ZF  + a>\  is  regular  + Wa  e < to i). 

Proof.  Con(d)  — >■  Con(a)  follows  from  11.5  and  Con(a)  — > Con(b)  from  11.1 
once  it  is  established,  (b)  — »■  (c)  and  (c)  ->  (d)  are  direct,  the  first  from  1 1 .4(b) 

and  the  fact  that  DC  implies  that  a>\  is  regular,  and  the  second  as  follows:  For 

any  a e "&>,  V(a>)  fl  L[a]  has  a well-ordering  in  L[a ] of  ordertype  so 

— &>i  would  imply  that  to\  < 2K°.  H 

The  question  of  whether  an  inaccessible  cardinal  is  necessary  in  11.1  for 
Lebesgue  measurability  or  the  Baire  property  was  left  unresolved,  and  consid- 
erable effort  was  eventually  expended  into  trying  to  eliminate  the  large  cardinal 
hypothesis.  In  one  of  those  twists  that  makes  set  theory  so  intriguing  Shelah  [84] 
in  1979  put  these  issues  to  rest  by  establishing  that  indeed  an  inaccessible  cardinal 
is  not  necessary  for  the  Baire  property,  but  rather  unexpectedly,  it  is  necessary  for 
Lebesgue  measurability  (see  volume  II). 

After  this  extended  preamble,  we  finally  begin  the  ascent  to  11.1  with  the 
preliminaries  toward  the  Lebesgue  measurability  result. 

Random  Reals 

Solovay  took  the  first  step  beyond  the  p.o.’s  like  Col(k,  S)  of  §10  that  arose 
initially  with  the  creation  of  forcing  and  investigated  forcing  with  the  familiar 
a -algebra  of  Borel  sets.  With  the  §0  terminology,  consider 

B*  = {X  e B | X is  not  null]  , 


ordered  by 


P <q  iff  p^q 


as  a p.o.  for  forcing.  The  following  is  simple: 


11.7  Exercise. 

(a)  For  p , q e B* , p F q iff  p IT  q is  null. 

(b)  B*  has  the  w\-c.c. 

Hint,  (b)  is  like  2.1.  H 


It  is  simple  to  see  that  B*  is  not  separative,  and  that  its  separative  quotient 
B*  /&  can  be  formulated  with  p m q iff  p A q is  null.  Using  this  as  a definition 
~ can  be  extended  to  all  of  Ml,  the  a -algebra  of  Lebesgue  measurable  sets. 
The  quotient  Ml /^,  which  coincides  with  B/^,  is  a Boolean  algebra  under 
the  set-theoretic  operations  modulated  by  « called  the  Lebesgue  measure  algebra. 
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Solovay  worked  in  terms  of  this  algebra,  but  carrying  on  with  B*  avoids  in- 
equivalence classes  and  makes  no  difference  by  10.3. 

Toward  Solovay’s  characterizations  of  £>*-genericity,  codes  for  the  open, 
closed,  and  Gs  sets  are  developed:  For  each  c e 0,co,  set 


Ac 


U{0(s,')  I c(i  + 1)  = 1}  if  c(0)  = 0 , 

“co  - Uf<5(S/)  I c(i  + 1)  = 1}  if c(0)  = 1 , and 
rVLKOfo)  I c(2,!3,+1)  = 1})  if  c(0)  > 1 . 


c is  an  open  code  if  c(0)  = 0,  c is  a closed  code  if  c(0)  = 1,  and  c a Gj  code 
if  c(0)  > 1.  For  every  open,  closed,  or  Gs  set  X there  is  a respective  code 
c such  that  X — Ac.  All  the  Borel  sets  can  be  coded  as  in  Solovay  [70],  but 
that  complexity  is  not  necessary  because  of  the  approximation  result  0.9.  While 
the  extensions  of  these  sets  may  vary  from  model  to  model,  their  Boolean  and 
measure-theoretic  essence  can  be  described  absolutely  in  terms  of  their  codes. 


11.8  Lemma.  Suppose  that  M be  a transitive  e-model  o/ZF.  Then: 

(a)  The  following  are  absolute  for  M: 

x e Ac  , Ac  0 , Ac  ^ Arf  , Ac  C ( &>  A^)  , and  Ac  n A^  — Ae. 

(That  is,  if  x,  c e M and  c is  a code,  then  x e Af  iff  x e Ac;  and  so  forth.) 

(b)  If  c e M is  a code,  then  m’^iA’f)  = mi(Ac). 

Proof,  (a)  This  is  straightforward  since  the  codes  determine  the  interplay  of  the 
<9(S/)’s,  which  in  turn  only  depends  on  the  s,’s. 

(b)  Suppose  first  that  c is  an  open  code.  A sequence  {ij  \ j e co)  can  be 
defined  from  c such  that  Ac  is  a disjoint  union  of  the  0(s,.)’s,  as  in  the  proof  of 
0.8(d).  Thus,  mL(Ac ) is  determined  absolutely. 

For  a closed  code  c,  let  d be  an  open  code  such  that  A^  = wa>  — Ac  and  use 
(a). 

Finally,  suppose  that  c is  a Gs  code.  A sequence  (c„  | new)  of  open 
codes  can  be  defined  from  c such  that  ACm  3 ACn  if  m < n and  Ac  = fj(I  ACii  . 
mL(Ac)  = inf ({mL(AcJ  \ n e co}),  and  thus  is  determined  absolutely.  H 

Forcing  with  B*  can  now  be  characterized  in  terms  of  adding  a real: 


11.9  Theorem  (Solovay  [65b,  70]).  Suppose  that  G is  B* -generic.  Then  there  is 
a unique  x e m co  such  that  for  any  closed  code  c e V, 

x e Aj^l  iff  Avc  e G , 


and  V[x ] = V[G], 

Proof.  The  “C  part”  of  0.9(a)  has  the  following  consequences: 
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(i)  For  any  n e co, 

{C  e B*  | C is  closed  A 3k(C  c {/  e “co  \ f(n)  = k})} 
is  dense  in  B* . 

(ii)  For  any  Lebesgue  measurable  A c.“ao  with  A e V, 

{C  e B*  | C is  closed  A (CU  v C D A = 0)} 
is  dense  in  B* . 

For  (ii),  suppose  that  p e B*  is  arbitrary.  If  mL(p  fl  A)  > 0,  there  is  a closed 
C e B*  such  that  C c p n A.  Otherwise  mL(p  — A)  > 0 and  there  is  a closed 
C e B*  such  that  C c p — A. 

Arguing  in  V[G]  there  is  a unique  x e “co  specified  by 

[x]  = f|{Aj[cl  | c e V is  a closed  code  A Avc  e G } 

since  this  is  an  intersection  of  closed  sets  with  the  finite  intersection  property  and 
(i)  holds. 

Suppose  now  that  c e V is  an  arbitrary  closed  code.  If  A e G,  then 
x e by  definition.  Conversely,  suppose  that  x e To  show  that 

e G,  it  suffices  by  (ii)  to  show  that  for  any  closed  code  d e V with  Aj  e 
G,  Avc  n Aj  ^ 0.  But  this  follows  from  x e A^[G]  n Aj[G]  and  11.8(a). 

Finally,  G is  definable  in  V[x]  by 

G = {peB*  | 3c(c  e V is  a closed  code  Axe  Aj[x]  a Afv[x]  c p)}  . 
Flence,  V[x]  = V[G].  H 

Solovay  called  such  an  x a random  real,  more  precisely,  for  M a transitive 
e-model  of  ZFC  and  x e 10 co, 

x is  random  over  M iff  x is  as  in  11.9  for  some  (B*)M -generic  G over  M . 

The  next  result  is  an  informative  characterization  of  random  reals  in  terms 
of  codes,  and  Solovay  [70: 33]  observed  how  it  reflects  the  intuitive  concept  of 
randomness.  Ironically,  as  most  of  the  generic  reals  were  eventually  named  after 
their  discoverers,  Solovay  squandered  an  opportunity  by  his  choice  of  le  mot  juste ! 
In  what  follows,  being  null  is  unambiguous  by  1 1.8(b),  and  1 1.8(a)  is  used  without 
further  mention. 

11.10  Theorem  (Solovay  [65b,  70]).  Suppose  that  M is  a transitive  e-model  of 
ZFC.  Then  x € “co  is  random  over  M iff  x Ac  for  any  c e M which  is  a G$ 
code  for  a null  set. 

Proof  Suppose  first  that  x is  random  over  M with  G the  corresponding  (B*)M- 
generic  over  M,  and  c e M is  a Gs  code  for  a null  set.  Then 

M |=  = [C  e B*  | C is  closed  A C c “co  — Ac]  is  dense  in  B . 
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Hence,  there  is  a closed  code  d e M such  that  Af  e G fl  D and  so  x e A^[c] . 
But  then,  x e Aj  and  so  x £ Ac. 

Conversely,  suppose  that  x <f  Ac  whenever  c e M is  a G$  code  for  a null  set. 
Considering  how  x can  be  used  to  define  a corresponding  G c (B*)M  as  in  the 
proof  of  1 1.9,  to  show  that  x is  random  over  M it  suffices  to  show  that:  whenever 
D e M is  dense  in  (J3*)M , there  is  a closed  code  c e M and  a p e D such  that 
A y c p and  x e Ac. 

To  do  this,  let  D e M be  dense  and  first  argue  in  M:  Let  W be  a maximal 
antichain  satisfying  W c {C  \ C is  closed  A 3p  e D(C  c p)}.  W is  countable 
by  the  cui-c.c.,  so  using  closed  codes  let  {ACn  | n e to)  enumerate  it.  Now  let 
c ewbo  be  any  function  satisfying  c(0)  > 1 and 

c(2"3i+1)  = 1 iff  0(Si)  n ACn  = 0 . 

Since  each  ACn  is  closed,  c is  a Gs  code  for  — ACn),  a null  set  by  the 

maximality  of  W. 

Stepping  out  of  M,  by  hypothesis  x £ Ac.  Hence,  x e ACn  for  some  cn  e M 
to  complete  the  proof.  H 

Random  reals  enter  the  coming  consistency  results  through  this  characteri- 
zation. Beyond  this  initial  use  they  have  found  a variety  of  applications  in  set 
theory. 

Proof  of  Solovay’s  Theorem 

The  main  theorem  11.1  is  proved  by  first  establishing  an  auxiliary  result  about 
the  Levy  collapse  and  then  passing  to  an  inner  model.  This  result  is  itself  quite 
significant  in  that  it  asserts  in  a ZFC  context  that  a large  collection  of  sets  of  reals 
is  well-behaved. 

A set  X is  "On -definable  iff  for  some  a e "On  and  formula  cp( V\,  V2 ), 
y e X iff  cp[a,  y]  . 

Of  course,  using  one  parameter  in  "On  is  the  same  as  using  finitely  many  from 
On  U "On  by  coding.  Like  the  well-known  concept  of  ordinal  definability  (see 
Kunen  [80:  V§2])  this  concept  is  formally  definable  within  set  theory  through  the 
Reflection  Principle  for  ZF:  X is  "On-definable  iff 

3 a3a3(p(a  e "Onfl  Va  A Vy(y  e X **  y e Va  A Va  \=  cp[a,  v]))  . 

11.11  Theorem  (Solovay  [65b,  70]).  Suppose  that  k is  an  inaccessible  cardinal 
and  G is  Col(o),  K)-generic.  Then  in  V[G],  every  "On- definable  set  of  reals  is 
Lebesgue  measurable,  has  the  Baire  property,  and  has  the  perfect  set  property. 

The  proof  hinges  on  a key  lemma  that  uses  previous  facts  developed  about  the 
Levy  collapse  Col(&>,  k)\ 
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11.12  Lemma.  For  each  formula  (p(v\)  there  is  a corresponding  formula  tp(v\) 
such  that:  for  any  x e "On  fl  V [G], 

V\G]  \=  <p[x]  iff  V[X]  \=  v[x]  . 

Proof.  Set  P — Col(&>,  k).  For  any  x e "Onfl  V[G],  by  10.21  there  is  a P-generic 
H over  V[x]  such  that  V[G]  — V[x][H],  However,  by  the  weak  homogeneity  of 
P (10.19)  it  is  not  necessary  to  know  H to  decide  whether  or  not  < p\x~\  is  satisfied: 

V[x][H]  (=  cp[x]  iff  V[x]  (=  r|b/>  ^a)1  • 

Hence,  <p  can  be  taken  to  be  a formula  in  one  free  variable  corresponding  to  this 
last  forcing  assertion  about  x and  P.  H 

Proof  of  11.11.  Argue  in  V[G].  Note  first  that 

for  any  a e "On,  "<u  fl  V[a\  is  countable 

since  k is  still  inaccessible  in  V[a]  by  10.12  and  10.17(e). 

Suppose  now  that  A c wco  is  "On-definable,  so  that  for  some  a e "On  and 
<P(Vi,  v2), 

x e A iff  (p\a , x]  . 

Using  a two-variable  version  of  11.12  there  is  a formula  <p(v\,  v2)  such  that 

x e A iff  U[fl][x]  1=  $[a,x]  . 

We  first  verify  A’s  Lebesgue  measurability:  Note  that 

{x  e "®  | .v  is  not  random  over  V[a]} 

is  null,  being  equal  by  11.10  to 

(jMc  I c e V[a~\  is  a Gs  code  for  a null  set)  , 

a countable  union  of  null  sets  as  "coO  V[a\  is  countable.  Thus,  it  suffices  to  find 
a Borel  set  X such  that  A AX  consists  of  reals  not  random  over  V/[«]: 

For  B*  forcing  in  the  sense  of  V[a]  with  r a canonical  name  for  the  random 
real,  let  Y e V[a]  be  a maximal  antichain  consisting  of  closed  sets  each  deciding 
<p(a,  r),  all  in  the  sense  of  V[a].  If  x is  random  over  V[a],  by  genericity 

x e A iff  xe  lJ{Ae  | Acv[a]  e Y a Acv[fl]  |h  f(d,  r)}  . 

Since  Y is  countable  by  the  co\ -c.c.,  the  set  on  the  right  is  a Borel  (in  fact  an  Fa) 
set,  and  hence  can  serve  as  our  desired  set  X. 

The  verification  of  the  Baire  property  is  only  outlined  since  it  parallels  the 
Lebesgue  measurability  argument.  (As  mentioned  before,  Shelah  [84]  has  shown 
that  an  inaccessible  cardinal  is  not  necessary  to  get  the  consistency  of  every  set 
of  reals  having  the  Baire  property.  ) The  idea  is  to  force  instead  with 


1 1 . Lebesgue  Measurability 


141 


£>'  = {X  e B | X is  not  meager}  , 

ordered  by  inclusion.  The  separative  quotient  this  time  has  a countable  dense 
subset,  namely  the  equivalence  classes  of  the  0(s)’s,  and  so  by  10.20  forcing 
with  £>'  coincides  with  forcing  with  Col(&>,  {«}),  i.e.  adding  a Cohen  real.  Using 
0.12(b)  and  introducing  codes  for  the  Fa$  sets  (countable  intersections  of  Fa  sets), 
analogues  of  the  characterizations  11.9  and  11.10  for  Cohen  reals  can  be  estab- 
lished with  Gs  replacing  closed  and  F„$  replacing  G$.  The  rest  of  the  argument 
goes  through,  modulated  only  by  these  changes. 

Finally,  we  verify  the  perfect  set  property  for  A.  It  can  be  assumed  that  A is 
uncountable,  so  that  because  “a>r\V[a]  is  countable,  there  must  be  a y e A — V\a\. 
By  10.21  there  is  a Col(a>,  Ar)-generic  //  over  V[«]  such  that  V[a][Fl]  — V[G ] 
and  so  by  10.17(e)  there  is  a p.o.  Q e V[a]  satisfying  Q c Col(<w,  k)  and  \ Q\  < k 
such  that  y e V\a~\[H  fl  Q\.  Temporarily  let  ||-  denote  the  forcing  relation  for  Q 
in  V[a~\.  There  is  a condition  p and  a name  y such  that 

P lh  y e a(o  -V[a]  A V [a][.v]  h &(a,  y)  . 

Also,  Vv^‘HQ)  is  countable  in  V[G]  as  k is  inaccessible  in  V[a],  so  there  is  an 
enumeration  ( Dn  \ new)  of  the  dense  subsets  of  Q in  V[a].  To  each  t e <0>2 
can  now  be  associated  a p,  e Q and  a u,  e <w2  as  follows: 

Let  pf,  < p be  such  that  p 0 e Dq.  Given  p,  < p with  |r|  = n,  define 
u t , pr~o,  and  pt~\  as  follows:  Because  p ||-  y <£.  V [a]  there  must  be  a least  kern 
such  that  p,  does  not  decide  a value  for  y(k).  Let  u,  satisfy  p,  ||-  y\k  = ut,  and 
choose  pt'o.  pr  \ both  in  Dn+ 1 so  that  p,- 0 and  p,~\  decide  different  values  for 
y(k). 

For  each  x e “2,  set 

Gx  = {q  e Q I 3 t(p,  <q  a fCj)}. 

By  the  construction  each  Gx  is  g-generic  over  V[a],  Next,  let 

C = {yG*  | .r  e M2j  . 

Then  CCA  since  p e Gx  for  every  x e "'2.  Finally,  C is  perfect,  since  it 
contains  no  isolated  points  and 

C = n„U{0(Kr)  I 

where  each  finite  union  is  closed  by  0.8(c)  so  that  overall  intersection  is  closed. 
This  completes  the  proof  of  1 1 . 1 1 . H 

Solovay  established  the  main  result  11.1  by  passing  in  V[G ] to  the  inner 
model  arising  naturally  from  the  previous  proof,  the  class  of  hereditarily  "On- 
definable  sets,  i.e.  those  sets  .r  such  that  every  member  of  the  transitive  closure 
tc({x})  is  "On-definable.  Flere,  we  shall  pass  to  the  constructible  closure  (§3)  of 
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the  reals,  L(R).  This  is  the  standard  name  for  the  model,  and  R was  taken  in  §0 
to  be  the  Dedekind  completion  of  the  rationals,  but  in  any  case 

L{  R)  = LCco)  = L(V(co)) 

by  interdefinability.  Although  L(R)  is  in  general  a smaller  model  than  Solovay’s, 
it  can  be  shown  that  they  are  the  same  had  L been  the  ground  model  for  the  initial 
Levy  collapse.  Moreover,  L(R)  has  figured  prominently  in  the  investigation  of 
strong  hypotheses  (§§27,30).  The  verification  of  DC  is  less  immediate  than  for 
Solovay’s  original  inner  model,  but  is  of  intrinsic  interest. 

Proof  of  11.1.  Building  on  11.11,  continue  to  work  in  V[G],  "u>nL(R)  = a,co,  so 
through  codes  every  open,  closed,  Gs  and  hence  F„  set  is  in  L(R)  and  satisfies 
these  respective  properties  in  L( R).  (In  fact,  every  Borel  set  is  in  L(R).) 

Suppose  now  that  A C with  A e L( R).  Then  A is  "On-definable,  and 
so  by  the  proof  of  11.11  there  is  an  Fa  set  X such  that  A A A is  null.  It  follows 

(by  0.9(a))  that  for  each  n e co  there  is  an  open  set  On  I)  A AX  such  that 

mL(On ) < — j.  By  the  previous  paragraph  and  absoluteness  (11.8),  all  this  holds 
in  L(R)  as  well,  and  so  (A  is  Lebesgue  measurable) L(R).  The  argument  for  the 
Baire  property  is  analogous. 

For  the  perfect  set  property,  it  can  be  assumed  in  addition  that  A is  uncount- 
able in  L(R).  Then  A is  (really)  uncountable,  since  every  countable  well-ordering 
is  coded  by  a real  and  hence  in  L(R).  Thus,  A has  a perfect  subset  C by  11.11, 

and  it  is  simple  to  see  that  C is  perfect  in  the  sense  of  L(R). 

The  following  proposition  completes  the  proof  of  11.1. 

11.13  Proposition  (ZF  + DC).  (DQL(R). 

Proof  First  note  that  by  straightforward  means  as  for  the  definition  of  a well- 
ordering of  L,  a surjection  <P:  On  x R ->  L(R)  can  be  defined  by  transfinite 
recursion  so  that  for  any  a,  0\  (a  x R)  e L(R). 

Suppose  now  that  X,  R e L(R),  X f 0,  R c X x X,  and 

V.r  e X3y  e X({x,y)  e R ) . 

Arguing  in  V,  by  DC  there  is  an  f ewX  such  that  (/(«),  f(n  + 1))  e R for  every 
n e co.  Let  S be  such  that  ran  (/)  c <p“(<5  x R),  and  using  AC  a,,  a consequence 
of  DC,  let  ( an  \ n e co)  be  such  that  for  every  n e co  there  is  a § <5  such  that 
<£((;,  a„)  = f(n).  Now  define  a relation  -<  on  8 x co  by: 

iff  i = j + 1 A (<£(£,  cij),  a,-))  e R . 

0\(S  x R)  e L(R),  and  the  sequence  of  reals  ( an  \ n e co)  e L(R)  since  it 
can  be  coded  by  one  real.  Hence,  -<  e L(R).  Now  -<  is  ill-founded  in  L( R) 
because  of  ran (/)  by  absoluteness  (0.3).  Let  E c 8 x co  with  E e L(R)  be 
non-empty  and  without  a ^-minimal  element.  Then  starting  with  any  ( rj , if  e 
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E a sequence  ( rj „ \ n e co)  can  be  defined  by  recursion  so  that  rj0  — rj  and 
{<P(r)n,aio+„),  0(r]n+i,aio+n+ 1)>  e R for  every  n e co,  specifying  ijn+ 1 to  be  the 
minimal  possible  given  r]„.  By  definability  (<P(?;„,  a,0+„)  | n e co)  e L(M),  and 
the  proof  is  complete.  H 

Having  finally  completed  the  proof  of  11.1,  this  section  is  concluded  with  a 
compact  discussion  of  one  further  problem  resolved  in  Solovay’s  model.  An  early 
observation  in  the  study  of  partition  relations  was  that  if  there  is  a well-ordering 
of  the  reals,  then  co  — > (co)®  fails  (7.1).  By  devising  a new  generic  real,  Mathias 
[68,77]  established  that  this  partition  property  holds  in  Solovay’s  model. 

The  Mathias  forcing  p.o.  is 

p = [a>]<<0  x [®r 

ordered  by: 

(s,  A)  < (t.  B ) iff  t is  an  initial  segment  of  s A A U (s  — t)  C B . 

The  idea  is  that  a condition  ( s , A)  determines  an  initial  segment  of  a new  subset 
of  co,  whose  further  members  are  to  be  restricted  to  A.  Such  a forcing  was  first 
formulated  by  Prikry  for  measurable  cardinals  (§18).  Note  that  if  (s,  A)  ||  (. t , B), 
then  A fl  B is  infinite  and  s is  an  initial  segment  of  t or  vice  versa.  Thus,  if  G is 
P-generic,  then  G determines  a subset  of  co, 

xc  = L>  | 3A((s,  A)  e G)}  . 

xq  in  turn  determines  G,  since  if 

G'  = {(s.  A)  e P | s is  an  initial  segment  of  xq  and  xc  — s c A}  , 

then  G — G'\  It  is  simple  to  check  that  G C G'  and  G'  is  P-generic.  But  then 
G — G'  on  general  grounds:  For  any  (, s , A)  e P, 

{(t,  B)  e P | (t,  B)  < {s,  A)  v (t,  B)L{s,  A)} 

is  dense  in  P,  and  so  ( s , A)  e G'  readily  implies  that  (s,  A)  e G. 

Thus,  V[xc ] — V[G].  Such  an  xc  is  called  a Mathias  real,  more  precisely, 
for  M a transitive  e-model  of  ZFC  and  x c co, 

x is  a Mathias  real  over  M iff  x = xq  for  some  PM-generic  G over  M . 

(Here,  it  is  more  convenient  to  consider  reals  as  subsets  of  co.)  P does  not  have  the 
coi-c.c.,  but  it  preserves  co\  and  has  other  useful  features  that  have  led  to  several 
applications.  Mathias  himself  established  some  crucial  properties: 

11.14  Proposition  (Mathias  [77:  86]). 

(a)  For  any  {s,  A)  e P and  formula  (p  in  the  forcing  language,  there  is  a 
B e [AY  such  that  { s , B)  ||  <p. 

(b)  If  x is  a Mathias  real  over  M,  then  any  y e [x]w  is  also  a Mathias  real 

over  M.  H 
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The  point  of  (a)  is  that  the  ,v  need  not  be  extended  to  decide  formulas;  simpler 
proofs  for  analogous  results  for  measurable  cardinals  will  be  provided  later  (§18). 

For  discussing  versions  of  co  — > (co)®  depending  on  the  complexity  of 
partitions, 

Y c [co]w  is  Ramsey  iff  3x  e c Tv  [*]"  n Y = 0)  . 

Thus,  co  — > (&>)“  iff  eveiy  Y C [co]10  is  Ramsey.  See  the  end  of  §27  for  more 
about  this  property. 

11.15  Exercise  (Mathias  [68,77]  ).  Suppose  that  there  is  an  inaccessible  cardinal 
k and  G is  Col(&>,  k)- generic.  Then 

(a)  In  V[G],  every  “ On-definable  subset  of  [w]“  is  Ramsey. 

(b)  In  L(M)y[cd,  co  — » ( 00)2  holds. 

Hint.  For  (a),  let  Y C satisfy 

x e Y iff  V[a][x]  H 

with  a e wa>.  By  11.14(a),  there  is  an  (0,  A)  e such  that  (0,  A)  ||  <p(a , r), 
where  r is  a canonical  name  for  the  Mathias  real.  Now  note  that  there  is  a Mathias 
real  .r  over  V[a]  satisfying  x C.  A,  and  conclude  by  1 1.14(b)  that  such  anx  verifies 
that  Y is  Ramsey.  H 

Unlike  the  other  properties  considered,  it  is  not  known  whether  an  inaccessible 
cardinal  is  necessary  for  co  — >-  (w)® : 

11.16  Question.  Does  Con(ZFC)  imply  Con(ZF  + DC  + co  — * («)")? 

Solovay’s  fruitful  ideas,  first  developed  to  establish  the  global  result  11.1, 
were  to  lead  to  important  hierarchical  extensions  of  classical  results  in  descriptive 
set  theory  (§14). 
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12.  Descriptive  Set  Theory 

Descriptive  set  theory  is  the  definability  theory  of  the  continuum,  the  study  of  the 
structural  properties  of  definable  sets  of  reals.  Motivated  initially  by  constructivist 
concerns,  a major  incentive  for  the  subject  has  been  to  investigate  the  extent  of 
the  regularity  properties,  properties  indicative  of  well-behaved  sets  of  reals  of 
which  Lebesgue  measurability,  the  Baire  property,  and  the  perfect  set  property 
are  the  prominent  examples.  The  subject  developed  progressively  from  Suslin’s 
results  on  analytic  sets  in  1916,  until  Godel  established  delimitative  results  showing 
under  V — L that  there  are  simply  defined  sets  of  reals  that  do  not  have  the 
regularity  properties.  In  the  ensuing  years  Kleene  developed  what  turned  out 
to  be  the  effective  version  of  the  theory  as  a generalization  of  his  foundational 
work  in  recursion  theory,  and  considerably  refined  the  classical  results.  Then  in 
1965  Solovay  reactivated  the  classical  program  by  building  on  his  11.1  to  provide 
characterizations  for  the  regularity  properties  at  the  level  of  GddeTs  V — L 
delimitation.  While  11.1  had  established  the  relative  consistency  of  every  set  of 
reals  having  the  regularity  properties,  Solovay’s  further  results  began  the  use  of 
large  cardinal  hypotheses  to  derive  directly  the  regularity  properties  for  sets  of 
reals,  an  approach  that  eventually  met  with  remarkable  success  by  the  late  1980’s 
as  investigations  with  forcing  and  inner  models  established  unexpectedly  close 
connections  between  large  cardinal  hypotheses  and  definable  sets  of  reals. 

This  and  the  next  section  serve  as  a basis  for  succeeding  sections  by  reviewing 
the  historical  development  and  the  basic  theory,  and  §§14, 15  discuss  work  of 
Solovay  and  then  Martin  incorporating  large  cardinal  hypotheses.  Moschovakis 
[80]  provides  a comprehensive  treatment  of  descriptive  set  theory  in  a general 
context,  Rogers  [80]  a compendium  of  recent  results  on  analytic  sets,  Kechris  [95] 
an  exposition  of  the  classical  theory  in  light  of  recent  work,  and  Hinman  [78]  an 
approach  emphasizing  hierarchies.  Mansfield- Weitkamp  [85]  and  Jech  [03:  §25] 
are  shorter  accounts  that  incorporate  modem  developments  using  forcing.  This 
section  begins  by  recounting  the  early  junctures  of  the  subject,  deferring  proofs  to 
the  next  section. 

Descriptive  set  theory  has  its  origins  in  the  work  of  the  French  analysts  Borel, 
Baire,  and  Lebesgue  at  the  turn  of  the  century.  Following  important  contributions 
to  the  theory  of  integration  by  Cantor  as  part  of  his  study  of  sets  (see  Flawkins 
[75:  3.3]),  Borel  [98:46-47]  isolated  the  Borel  sets  through  an  axiomatically  pre- 
sented measure.  Taking  a concrete  approach  to  the  general  concept  of  function, 
Baire  in  his  thesis  [99]  distinguished  and  provided  a hierarchical  analysis  of  the 
Baire  functions,  those  functions  in  the  smallest  collection  containing  the  continuous 
functions  of  several  real  variables  and  closed  under  the  taking  of  pointwise  limits. 
While  Lebesgue’s  thesis  [02]  is  fundamental  for  measure  theory  as  the  source  of  his 
concept  of  measurability,  his  first  major  work  in  a distinctive  direction  was  to  be 
the  seminal  paper  of  descriptive  set  theory:  In  the  memoir  [05]  Lebesgue  investi- 
gated the  Baire  functions,  stressing  that  they  are  exactly  the  functions  definable  via 
analytic  expressions  (in  a sense  made  precise).  He  correlated  the  Baire  functions 
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with  the  Borel  sets  by  showing  that  the  latter  are  exactly  the  pre-images  of  open 
sets  by  the  former.  With  this  he  had  introduced  the  first  hierarchy  for  the  Borel 
sets,  differing  in  minor  details  from  the  now  standard  one  from  Hausdorff  [14], 
In  §0  :"o>  served  as  the  reals  for  formulations  and  the  extension  of  concepts 
to  k(f°co)  for  k e co  was  assumed  through  homeomorphisms.  It  is  now  incumbent 
to  state  our  further  definitions  explicitly  for  k(wco),  with  k implicitly  ranging  over 
co.  The  classes  £®  and  II®  for  0 < a < of  the  Borel  hierarchy  are  defined  by 
stipulating  membership  for  each  A C k(a>co ) as  follows: 

A e iff  A is  open  in  k(“co)  , and 

A e IT®  iff  A is  closed  in  k (“co)  , 


and  recursively  for  a > 1, 

iff  A is  the  union  of  a countable  subset  of  Uo<^<a  ^ , and 
Aen°a  iff  A is  the  intersection  of  a countable  subset  of  [_J0<^<a,  £/?  • 

Each  L®  fl  V(k(“co))  is  closed  under  the  taking  of  countable  unions,  and  each 
II®  fl7 D(k(a>co)),  the  taking  of  countable  intersections.  It  is  readily  seen  by  induction 
that  £®  and  IT®  are  dual  collections,  i.e.  A e L®  iff  k(“co)-A  e II®  for  A c k(“co). 
Moreover,  L®  C £®  by  0.8(c)(d),  and  so  by  induction  0 < f < a implies  that 
Sjunjcsjjn  II® . By  a cofinality  argument  the  subsets  of  k(“co)  in 
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comprise  a cr -algebra,  and  hence  are  the  Borel  subsets  of  k(“co).  In  §§0,11 
Hausdorff  s terms  Fa  and  Gs  were  used  for  the  subsets  of  “co  in  £®  and  11® 
respectively;  the  now  standard  L®  and  II®  notation  is  from  Addison  [58]. 

The  main  results  of  Lebesgue  [05]  are  that  this  is  a proper  hierarchy  since 
he  showed  in  effect  that  L®  £ II®  for  any  countable  a > 0,  and  that  there  is  a 
Lebesgue  measurable  set  which  is  not  a Borel  set.  In  fact,  this  paper  is  remarkable 
for  anticipating  central  concerns  and  techniques  of  mathematical  logic:  the  sys- 
tematic study  of  definable  sets  and  the  use  of  Cantor’s  universal  enumeration  and 
diagonalization  argument  to  effect  transcendence.  Although  Lebesgue’s  reserva- 
tions about  arbitrary  functions  had  turned  to  open  skepticism  with  the  appearance 
of  Zermelo’s  Well-Ordering  Theorem  [04],  it  is  ironic  that  his  paper  contained 
tacit  but  essential  uses  of  the  Axiom  of  Choice  (see  Moore  [82:2.3]). 

Descriptive  set  theory  emerged  as  a distinct  discipline  through  the  initiatives 
of  the  Russian  mathematician  Nikolai  Luzin.  Through  a focal  seminar  that  he 
began  in  1914  at  the  University  of  Moscow,  he  was  to  establish  a prominent 
school  in  the  theory  of  functions  of  a real  variable  (see  Phillips  [78];  Keldysh 
[74],  Uspenskii  [85],  and  Kanovei  [85]  survey  the  contributions  to  descriptive  set 
theory).  Luzin  had  become  acquainted  with  the  work  and  views  of  the  French 
analysts  while  he  was  in  Paris  as  a student,  and  from  the  beginning  a major  topic 
of  his  seminar  was  the  “descriptive  theory  of  functions”.  Significantly,  the  young 
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Sierpinski  was  an  early  participant;  he  had  been  interned  in  Moscow  in  1915,  and 
Luzin  and  his  teacher  Dmitry  Egorov  interceded  on  his  behalf  to  let  him  live  freely 
until  his  repatriation  to  Poland  a year  later.  Not  only  did  this  lead  to  a decade- 
long  collaboration  between  Luzin  and  Sierpinski,  but  it  undoubtedly  encouraged 
the  latter  in  his  founding  of  the  Polish  school  (§2)  and  laid  the  basis  for  its  interest 
in  descriptive  set  theory. 

Luzin  described  the  cardinality  problem  for  Borel  sets  (operatively  becoming 
whether  they  have  the  perfect  set  property)  to  Pavel  Aleksandrov,  an  early  member 
of  the  seminar  and  later  a pioneer  of  modem  topology.  That  the  Borel  sets  are 
Lebesgue  measurable  and  have  the  Baire  property  are  intrinsic  to  these  concepts, 
but  little  was  known  about  the  perfect  set  property  beyond  the  Cantor-Bendixson 
result  11.3(a).  The  property  was  implicit  in  a scheme  of  William  Young  [03] 
clarified  by  Hausdorff  [14: 465ff],  who  used  it  to  show  that  every  L®  set  is  either 
countable  or  has  cardinality  2K°.  Hausdorff  [16]  then  extended  this  conclusion  to 
all  Borel  sets.  Aleksandrov  [16]  was  able  to  establish  that  the  Borel  sets  have  the 
perfect  set  property.  The  proof  required  a new  way  of  comprehending  the  Borel 
sets,  as  it  turns  out  that  a set  having  the  perfect  set  property  does  not  imply  that 
its  complement  does,  and  so  an  inductive  proof  through  the  Borel  hierarchy  is 
not  possible.  The  new,  more  direct  analysis  of  Borel  sets  broke  the  ground  for  a 
dramatic  development: 

Soon  afterwards  another  student  of  Luzin’s,  Mikhail  Suslin  (often  rendered 
“Souslin”  in  the  French  transliteration),  began  reading  Lebesgue  [05].  Memoirs 
of  Sierpinski  [50: 28ff]  recalled  how  Suslin  then  made  a crucial  discovery.  For 
any  set  B,  the  projection  of  B is 

pB  = [x  | 3 y({x,  y)  e B)}  . 

In  particular,  for  B c i+1(®o>)  = *(®<y)  x wco, 

pB  = { (x  i , *..,xk)  | 3y((xi ,xk,  y)  e B)}  . 

Suslin  noticed  that  at  one  point  Lebesgue  asserted  ([05: 191-192])  that  the  pro- 
jection of  a Borel  subset  of  the  plane  (2  ('"&>)  in  our  formulation)  is  also  a Borel 
set.  This  was  based  on  the  mistaken  claim  that  given  a countable  collection  of 
subsets  of  the  plane  the  projection  of  their  intersection  is  equal  to  the  intersection 
of  their  projections  - in  terms  of  quantifiers,  an  unjustifiable  V3  to  3V  switch. 
Suslin  soon  found  a counterexample  to  Lebesgue’s  assertion,  and  this  led  to  his 
inspired  investigation  of  the  analytic  sets.  For  A C k(a>co), 

A is  analytic  iff  A — pB  for  some  closed  subset  B of  <r+1(M®)  . 

(To  proceed  classically  with  M instead  of  “closed”  here  must  be  replaced  by 
“Gs”,  i.e.  countable  intersections  of  open  subsets  of  R.)  Suslin  [17]  first  formulated 
the  analytic  sets  as  the  A-sets  (les  ensembles  (A)),  sets  resulting  from  an  explicit 
operation,  the  Operation  (A),  and  then  characterized  them  as  projections.  He  an- 
nounced three  main  results,  that  every  Borel  set  is  analytic,  that  there  is  an  analytic 
set  which  is  not  Borel , and  the  following  characterization  (established  by  13.5): 
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12.1  Theorem  (Suslin  [17]).  For  A C k(wa>),  A is  Borel  iff  A and  k (0Ja>)  — A are 
both  analytic. 

Thus,  the  Borel  sets  can  be  characterized  in  simple  terms  from  above,  as 
well  as  analyzed  through  a hierarchy  from  below.  Paradigmatic  for  later  hierarchy 
results,  this  theorem  really  began  the  subject  of  descriptive  set  theory.  [17]  was  to 
be  Suslin’ s sole  publication,  for  he  succumbed  to  typhus  in  a Moscow  epidemic  in 
1919  at  the  age  of  25.  (The  whole  episode  recalls  a pivotal  mistake  by  Augustin- 
Louis  Cauchy  and  the  clarification  due  to  the  young  Niels  Abel  that  led  to  the 
concept  of  uniform  convergence,  even  to  an  unjustifiable  V3  to  3V  switch  and 
Abel’s  untimely  death.) 

In  an  accompanying  note,  Luzin  announced  the  regularity  properties  for  an- 
alytic sets  ((a)  and  (b)  are  established  by  14.4  and  (c)  by  14.8): 

12.2  Theorem  (Luzin  [17];  (c)  is  attributed  there  to  Suslin). 

(a)  Every  analytic  set  is  Lebesgue  measurable. 

(b)  Every  analytic  set  has  the  Baire  property. 

(c)  Every  analytic  subset  of  cocl>  has  the  perfect  set  property. 

(The  attribution  of  (c)  to  Suslin  is  actually  a faint  echo  of  a question  of 
priority.  According  to  memoirs  of  Aleksandrov  [79:  284-286]  it  was  he  who  had 
defined  the  A-sets,  and  Suslin  proposed  the  name,  as  well  as  “Operation  (A)” 
for  the  corresponding  operation,  in  Aleksandrov’s  honor.  This  eponymy  is  not 
mentioned  in  Suslin  [17]  but  is  supported  by  recollections  of  e.g.  Keldysh  [74:  180] 
and  Kuratowski  [80:  69].  Aleksandrov  recalled  that  he  had  shown  that  every  Borel 
set  is  an  A-set  and  that  every  A-set  has  the  perfect  set  property,  although  this  is 
not  explicit  in  his  [16],  He  then  tried  hard  in  1916  to  show  that  every  A-set  is 
Borel,  only  discontinuing  when  it  became  known  that  in  the  summer  Suslin  had 
found  a non-Borel  A-set.  According  to  Aleksandrov:  “Many  years  later  Luzin 
started  to  call  A-sets  analytic  sets  and  began,  contrary  to  the  facts,  which  he  knew 
well,  to  assert  that  the  term  ‘A-set’  is  only  an  abbreviation  for  ‘analytic  set’.  But 
by  this  time  my  personal  relations  with  Luzin,  at  one  time  close  and  sincere,  were 
estranged.”  Luzin  [25,27]  did  go  to  some  pains  to  trace  the  term  analytic  back 
to  Lebesgue  [05]  and  pointed  out  that  the  original  example  there  of  a non-Borel 
Lebesgue  measurable  set  is  in  fact  the  first  example  of  a non-Borel  analytic  set. 
See  also  the  text  Luzin  [30:  186ff],  in  which  the  Operation  (A)  is  conspicuous  by 
its  absence.) 

The  notes  Suslin  [17]  and  Luzin  [17]  were  considerably  elaborated  in  the 
ensuing  years,  as  the  investigation  of  analytic  sets  was  continued  by  Luzin  and 
his  circle  in  Moscow  and  by  Sierpinski  in  Warsaw.  With  the  Borel  sets  having 
been  properly  extended  through  the  new  operation  of  projection,  Luzin  [25a]  and 
Sierpinski  [25]  formulated  the  projective  sets  as  those  sets  obtainable  from  the 
Borel  sets  by  the  iterated  application  of  projection  and  complementation.  There 
is  the  corresponding  projective  hierarchy,  formulated  in  modem  notation  for  coa>: 

For  A C ktyco). 
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AelJ  iff  A e 2®,  i.e.  A is  open  , 
and  recursively  for  n e a>, 

A e IT'  iff  kCoj)  - A e 21  , and 
A e 2*+1  iff  A = pB  for  some  TT,1,  subset  B of  k+ 1 (wol>)  . 

Thus,  A e 2}  iff  A is  analytic.  Also  define 

K = K n K . 

These  terms  are  also  used  as  adjectives,  as  in  “21  set”  or  “A  is  II1  2 1 and  II1 
are  dual  collections,  i.e.  A e 21  iff  k(wa>)  — A e IT,1,  for  A c k(tuco).  Moreover, 
2*  U II,1,  C A'+1.  This  follows  readily  by  induction  from  IIq  c nj,  which  itself 
is  seen  as  follows: 

Suppose  that  A e IlJ,  i.e.  A is  closed,  so  that  A = P|„€(U  G„  for  open  sets 
G„  (by  0.8(c)(d)).  Set  G — Uneo)  Gn  x 0((n >)  C 2(wa>).  Then  G is  open,  and 
for  any  x e wa>, 

x e A -o-  Vn(x  e G„) 

Vv((x,y>  e G) 

so  that  A e n}. 

Suslin’s  result  12.1  is  the  assertion  that  Aj  is  exactly  the  collection  of  Borel 
sets.  Luzin  [25c]  and  Sierpinski  [25]  observed  that  like  the  Borel  hierarchy  the 
projective  hierarchy  is  a proper  hierarchy  (12.8),  and  soon  its  basic  properties  were 
established.  For  example,  each  21  fY P(k(coa>)))  and  II1  C\V (k (co co))  is  closed  under 
countable  unions  and  intersections.  (However,  the  projective  subsets  of  V(k(a>a>)) 
as  a whole  are  not  so  closed;  to  obtain  this  closure,  one  can  continue  the  projective 
hierarchy  into  the  transfinite  incorporating  countable  unions  at  limit  stages.) 

On  the  other  hand,  the  investigation  of  projective  sets  encountered  basic 
obstacles  from  the  beginning.  For  one  thing,  unlike  for  2}  sets  the  cardinality 
problem  for  Ilj  sets  (operatively,  whether  they  have  the  perfect  set  property) 
could  not  be  resolved.  Luzin  [17: 94]  had  already  noted  this  difficulty,  and  it  was 
emphasized  as  a major  problem  in  Luzin  [25a],  In  a confident  and  remarkably 
prophetic  passage  he  declared  that  his  efforts  towards  its  resolution  led  him  to 
a conclusion  “totally  unexpected”,  that  “one  does  not  know  and  one  will  never 
know ” of  the  family  of  projective  sets,  although  it  has  cardinality  2K°  and  consists 
of  “effective  sets”,  whether  every  member  has  cardinality  2K°  if  uncountable,  has 
the  Baire  property,  or  is  even  Lebesgue  measurable.  Luzin  [25b]  pointed  out  the 
specific  problem  of  establishing  the  Lebesgue  measurability  of  2,  sets.  Both  these 
difficulties  at  Ilj  and  2,  were  also  observed  by  Sierpinski  [25:242],  although  he 
was  able  to  show  that  every  2j  set  is  a union  of  K!  Borel  sets  (13.7). 

The  first  wave  of  progress  from  Suslin’s  results  having  worked  itself  out, 
Luzin  provided  systematic  accounts,  especially  in  a classic  text  [30].  He  aired 
the  constructivist  views  of  his  French  predecessors,  and  not  only  did  he  contrive 
somewhat  self-effacingly  to  establish  definite  precedents  for  his  own  work  in 
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theirs,  but  he  also  espoused  their  distrust  of  the  unbridled  Axiom  of  Choice  and 
advocated  their  views  on  definability.  Lebesgue  [05]  had  ended  with  the  question, 
“Can  one  name  a non-measurable  set?”  Taking  this  as  a starting  point  of  his  own 
work,  Luzin  [30:  323]  wrote:  . . the  author  considers  the  question  of  whether 
all  projective  sets  are  measurable  or  not  to  be  unsolvable  [insoluble],  since  in  his 
view  the  methods  of  defining  the  projective  sets  and  Lebesgue  measure  are  not 
comparable,  and  consequently,  not  logically  related.” 

If  the  projective  sets  proved  intractable  with  respect  to  the  regularity  proper- 
ties, significant  progress  was  nonetheless  made  in  other  directions.  In  Luzin  [30a] 
the  investigation  of  uniformization  was  proposed.  For  A,  B c 2(fcS), 

A is  uniformized  by  B iff 

B c A A Vjc(3v((x,  y)  e A)  **  3!y((x,  y)  e B ))  . 

(As  usual,  3 ! abbreviates  the  formalizable  “there  exists  exactly  one”.)  Since  B is  in 
effect  a choice  function  for  an  indexed  family  of  sets,  asserting  the  uniformizability 
of  arbitrary  subsets  of  2 (fed)  is  a version  of  the  Axiom  of  Choice.  Taking  this 
approach  to  the  problem  of  definable  choices,  Luzin  announced  several  results 
about  the  uniformizability  of  £]  sets  by  like  sets.  One  was  affirmed  in  a sharp 
form  by  Petr  Novikov  [31],  who  showed  that  there  is  a closed  set  that  cannot  be 
uniformized  by  any  L]  set.  It  was  eventually  shown  by  Vladimir  Yankov  [41] 
that  any  H]  set  can  be  uniformized  by  a set  that  is  a countable  intersection  of 
countable  unions  of  differences  of  L]  sets.  Interestingly  enough,  von  Neumann 
[49:448ff]  also  established  a less  structured  uniformization  result  for  L|  sets  as 
part  of  an  extensive  study  of  rings  of  operators.  Presumably  because  of  the 
difficulties  encountered  in  the  study  of  Il|  sets  Luzin  [30a:  351]  claimed  that  there 
are  Ilj  sets  that  cannot  be  uniformized  by  any  “distinguishable”  set  and  gave  a 
purported  example.  Notwithstanding,  Sierpinski  [30: 139]  asked  whether  every 
n|  set  can  be  uniformized  by  a Il\,  or  even  any  projective,  set,  and  a result  of 
Novikov  in  Luzin-Novikov  [35]  did  lead  to  the  conclusion  that  every  Ilj  set  can 
be  uniformized  by  a L]  sef  Building  on  this,  the  Japanese  mathematician  Motokiti 
Kondo  established  (see  13.17)  the  Ilj  Uniformization  Theorem : 

12.3  Theorem  (Kondo  [37,39]).  Every  IT]  subset  of  2 (fed)  can  be  uniformized  by 
a n]  set. 

This  was  the  culminating  result  of  the  early  period.  As  Kondo  noted,  his 
result  implies  through  projections  that  every  L]  set  can  be  uniformized  by  a 
set,  but  the  question  of  whether  every  H]  set  can  be  uniformized  by  any  projective 
set  was  left  open. 

The  total  impasse  in  descriptive  set  theory  with  respect  to  the  regularity 
properties  was  to  be  explained  by  Godel’s  work  on  the  constructive  hierarchy.  In 
his  initial  article  [38]  on  L,  he  announced  that  if  V — L,  then  there  is  a A*  set 
of  reals  which  is  not  Lebesgue  measurable  and  a Ilj  set  of  reals  which  does  not 
have  the  perfect  set  property.  (See  13.10  and  13.12  for  proofs.)  Thus,  the  early 
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descriptive  set  theorists  were  confronting  an  obstacle  formalizable  in  ZFC.  The 
importance  that  Godel  attached  to  these  results  can  be  evinced  from  his  listing 
of  each  of  them  on  equal  footing  with  his  AC  and  GCH  results.  However,  he 
did  not  publish  proofs,  and  more  than  a decade  was  to  pass  before  proofs  first 
appeared  in  Novikov  [51].  In  the  meantime,  Godel  [51:67]  (see  Godel  [90:33]) 
had  sketched  a more  basic  result  that  readily  implies  under  V = L that  there  is 
a non-measurable  Aj  set  of  reals:  if  V = L,  then  there  is  a Lj  well-ordering  of 
the  reals.  (According  to  Kreisel  [80:  197],  “. . . according  to  Godel’s  notes,  not 
he,  but  S.  Ulam,  steeped  in  the  Polish  tradition  of  descriptive  set  theory,  noticed 
that  the  definition  of  the  well-ordering  ...  of  subsets  of  to  was  so  simple  that  it 
supplied  a non-measurable  PCA  [i.e.  Lj]  set  of  real  numbers  . . . ”)  Mostowski 
had  also  established  the  result  in  a manuscript  destroyed  during  the  war,  but  it  is 
not  apparent  in  Novikov  [51].  Details  were  eventually  provided  by  Addison  [59] 
who  built  on  this  work  to  settle  the  uniformization  problem  for  projective  sets  and 
related  issues  in  L.  Addison  was  a student  of  Kleene,  who  through  initiatives  from 
mathematical  logic  had  been  developing  what  is  now  called  effective  descriptive 
set  theoiy. 

After  his  fundamental  work  on  recursive  function  theory  in  the  1930’s,  Kleene 
expanded  his  investigations  of  effectiveness  and  developed  a general  theory  of  de- 
finability for  relations  on  to.  In  [43]  he  considered  the  arithmetical  relations , those 
relations  obtainable  from  the  recursive  relations  by  applications  of  number  quan- 
tifiers. Developing  canonical  representations  he  classified  these  relations  into  a 
hierarchy  according  to  quantifier  complexity  and  showed  that  it  is  a proper  hierar- 
chy. In  [55, 55a,  55b]  he  studied  the  analytical  relations , those  relations  obtainable 
from  the  arithmetical  relations  by  applications  of  function  quantifiers.  Again  he 
worked  out  representation  and  hierarchy  results,  and  moreover  he  established  an 
elegant  theorem  that  turned  out  to  be  an  effective  version  of  Suslin’s  characteri- 
zation 12.1  of  Borel  sets. 

Kleene  was  developing  what  amounted  to  the  effective  content  of  the  classical 
theory,  unaware  that  his  work  had  direct  antecedents  in  the  papers  of  Lebesgue, 
Luzin,  and  Sierpinski.  Certainly,  he  had  very  different  motivations:  With  the 
arithmetical  relations  he  wanted  to  extend  the  Incompleteness  Theorem,  and  ana- 
lytical relations  grew  out  of  his  investigations  of  notations  for  recursive  ordinals. 
Moreover,  his  work  featured  refinements  made  possible  by  his  Recursion  Theorem 
and  its  sometimes  miraculous  applications  in  the  method  of  effective  transfinite 
recursion.  On  the  other  hand,  already  in  [43:50]  he  did  make  elliptic  remarks 
about  possible  analogies  with  the  classical  theory.  Pursuing  one  such  analogy, 
Mostowski  [47]  had  also  derived  the  results  on  arithmetical  relations.  But  further 
investigations  exposed  shortcomings  with  this  analogy,  and  it  was  Addison  who 
established  the  proper  correlation:  it  is  the  analytical  relations  that  are  analogous 
to  the  projective  sets,  with  the  arithmetical  relations  being  analogous  to  the  sets 
in  Uo<n<®^«’  the  finite  levels  of  the  Borel  hierarchy.  As  he  would  put  it, 
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analytical  arithmetical 
projective  finite  level  Borel 

The  conceptual  move  in  Kleene’s  work  from  relations  on  co  to  relations  on  "'a> 
revealed  that  this  correlation  was  exact:  as  pointed  out  by  Addison  [58: 126]  the 
clopen  subsets  of  <°co  are  exactly  those  sets  recursive  in  some  real  (12.6(a)).  Had 
Kleene  seen  his  work  in  this  light  from  the  beginning,  it  is  doubtful  that  he  would 
have  initially  used  “analytic”  as  he  did  for  “analytical”;  the  latter  term  may  have 
evolved  to  draw  a distinction  from  the  classical  analytic  sets,  but  is  still  a source 
of  confusion. 

The  development  of  effective  descriptive  set  theory  considerably  clarified 
the  classical  context,  injected  recursion-theoretic  techniques  into  the  subject,  and 
placed  definability  considerations  squarely  at  its  forefront.  Not  only  were  new 
approaches  to  classical  problems  provided,  but  results  and  questions  could  now 
be  formulated  in  a refined  setting.  That  there  is  a basic  connection  between  the 
projective  sets  and  logical  definability  had  been  observed  in  Kuratowski-Tarski 
[31]  and  Kuratowski  [31];  it  is  the  careful  attention  to  real  parameters  that  distin- 
guishes the  modem  approach.  The  effective  context  is  forthwith  established  via 
definability  in  what  follows. 

The  development  of  descriptive  set  theory  can  be  carried  out  in  ZF  + DC, 
DC  being  the  Principle  of  Dependent  Choices  from  11.1.  The  principle  becomes 
crucial  for  the  later  use  of  strong  hypotheses  (Chapter  6);  it  is  then  needed  for  the 
characterization  of  well-foundedness  in  terms  of  the  lack  of  infinite  descending 
chains.  Recalling  that  DC  implies  the  countable  choice  principle  AC„„  consider 
its  further  restriction  to  sets  of  reals: 

(AQa (“&>))  Every  countable  set  consisting  of 

non-empty  sets  of  reals  has  a choice  function  . 

This  principle,  applied  to  choose  countably  many  reals  coding  well-orderings, 
suffices  to  show  that  co\  is  regular.  An  important  observation  for  later  is  that: 

The  main  development  through  this  and 

the  next  section  can  be  carried  out  in  ZF  + AC^co)  . 

Those  few  results  and  remarks  involving  more  substantial  uses  of  AC  will 
serve  mainly  to  frame  the  discussion  and  will  be  noted  explicitly. 

The  Definability  Context 

Second-order  arithmetic  is  taken  to  be  the  two-sorted  structure 
A2  = {co,  wco,  ap,  +,  x,  exp,  <,  0,  1)  , 

where  co  and  "&>  are  two  separate  domains  connected  by  the  binary  operation 
ap\  mco  x w -ms  of  application. 


ap(x,  m)  = x(m)  , 
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and  +,  x,exp , <,0,  1 impose  the  usual  arithmetical  structure  on  co,  with  exp 
being  exponentiation,  (exp  is  traditionally  not  taken  as  a basic  operation,  but 
its  inclusion  simplifies  our  presentation  without  altering  the  significant  concepts 
of  descriptive  set  theory.)  In  addition  to  symbols  for  these  features  the  underly- 
ing language  has  two  sorts  of  variables,  those  ranging  over  co  to  be  denoted  by 
Vq,  Vj,  Vj,  . . . and  those  ranging  over  wco  to  be  denoted  by  Vq,v\,v\ Cor- 

responding are  the  number  quantifiers  and  function  quantifiers,  to  be  denoted 
by  3°,V°  and  31,V1  respectively.  Terms  for  numbers  are  defined  recursively 
by  closing  off  0,  1,  v°,  u®,  v^, ...  under  +,  x,  exp,  and  ap(v) , •)•  For  any  such 
term  r,  (3°u(°  < r )q>  is  written  for  3°u?(u®  < r A cp)  and  (V0u(°  < r )cp  for 
V°i>,°(u,0  < r ->  <p)  whenever  possible;  these  are  the  bounded  quantifiers.  A2  is  a 
full  second-order  structure  with  ® co  playing  the  role  of  V(co). 

Generally  speaking,  questions  of  definability  in  this  structure  can  pertain  to 
any  relations  A c rco  x k(loco)  for  some  r,  k e co.  The  case  A c k(mco)  was  the 
preoccupation  of  classical  descriptive  set  theory,  but  the  general  case  is  intrinsic 
to  the  effective  theory.  Regarding  the  members  of  rco  x k(f°oo)  as  (r  + k 1-tuples, 
to  be  written  interchangeably  are 

A(m\, . . . , mr,  x\, . . . , Xk)  iff  (m\, . . . , mr,  x\, . . . , xf)  e A , 

with  A regarded  as  a relation  on  the  one  hand  and  a set  on  the  other.  Notational 
variations  like 

A(m,  w) 

when  r > 0,  m e co,  and  w e r~lcoxk(a>co)  have  the  expected  meaning.  Such  an  A 

is  definable  in  A2  by  a formula  (piv®, , . . ,v®,v\, . . . ,v\)  iff  for  any  m\ mr  e 

a>  and  x\, ...  ,xu  e " co, 

A(m\,  . . . , mr,  xi,  ... , xfi  iff  A2  1=  <p[m  i,  . . . , mr , X\,  . . . , Xk\  . 

Henceforth,  the  syntax  and  semantics  of  A2  are  informally  conflated  in  such  trav- 
esties as  V°m31xA(m,  x)  and  -■A(w),  focusing  primarily  on  relations  rather  than 
their  defining  formulas. 

Classifying  relations  according  to  quantifier  complexity,  stipulate  for  r,  k e co 
and  A c rco  x k(‘°co)  that: 

A e Aq  iff  A is  definable  in  A2  by  a formula 

whose  only  quantifiers  are  bounded  . 

Aq  is  also  used  as  an  adjective. 

A®  codings  of  sequences  will  often  be  used.  For  definiteness,  let  p,  denote 
the  (i  + l)st  prime,  and  for  mo, . . . ,mr,m,i  e co  set 

(mo mr)  = p™0+l  ■ p'"'+1  ■ ■ ■ p'fi+x  , and 

( m)i  = min({e  | pA2  does  not  divide  m})  . 
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These  functions  are  AjJ  (and  it  is  for  this  that  exp  was  included  in  A2).  Moreover, 
(m),-  is  always  defined,  and  ({mo, . . . , /«,  )),  = m,  for  i < r.  In  §0  an  enumeration 
(s,-  | i e to)  of  <01to  had  been  fixed;  henceforth,  the  decoding  of  s,-  from  i will  be 
taken  to  be  as  A®,  using 

)(m o,  . . . , mr)  if  i = {mo, . . . , mr)  , and 
0 otherwise  . 

For  x e 10 to  and  m e to  set 

x(m)  — (x (0) , . . . , x(m  — 1))  e to 
and  define  (*),■  e wco  for  i e to  by 

(x)i(m)  = x((i,  m))  . 

The  x notation  is  extended  to  w e rco  x k(wto)  by  stipulating  that  for  m e 
to,  w (m)  e r+kto  is  like  w except  that  each  occurrence  of  any  x e wto  has  been 
replaced  by  x(m). 

Continuing, 

A is  arithmetical  iff  A is  definable  in  A2  by  a formula 
without  function  quantifiers  . 

Bounded  quantifiers  can  be  shifted  to  the  right  and  like  quantifiers  contracted  using 
the  following  equivalences  and  their  duals  (obtained  by  negating  both  sides): 

(V®/?  < q)3°mR(p,  q,  m,  w)  -o-  3°w(V0/;  < q)R(p,  q,  (m)p,  w) 

3°m3° pR(m,  p,  w)  **  3°m/?((m)o,  (m)i,  w)  . 

Flence,  the  arithmetical  relations  can  be  classified  into  a hierarchy,  the  arithmetical 
hierarchy,  by  defining  the  (lightface!)  and  77°  classes  for  n e to  as  follows: 

A e X1®  iff  Vw(A(w)  3°7«iV°W2  . . . QmnR(m\, . .. , mn,  w))  , and 

A e 77®  iff  Vw(A(w)  •<->•  V°mi3°7n2  • • • QmnR(m\, . . . , mn,  w)) 

for  some  Aq  R c T+nco  x k(c°to),  where  Q is  3°  if  n is  odd  and  V°  if  n is 

even  in  the  first  and  vice  versa  in  the  second.  (It  is  notationally  incumbent  that 
Eq  — 77q  = z\®,  but  this  will  be  irrelevant.)  X1®  and  77®  are  dual  collections, 
i.e.  A e X®  iff  ' to  x k(mco)  — A e 77®;  X,®  U 77®  c X®+1  n 77®+1;  and  these  terms 
are  also  used  as  adjectives. 

A is  recursive  iff  A e X®  IT  77®  . 

This  conforms  for  A C ' to  to  the  original  1930’s  concept  of  recursive  relation 
(although  this  by  no  means  is  immediate  from  our  formulation).  Kleene  extended 
the  theory  in  the  1950’s  to  include  A C ra>  x k(f°co).  There  are  recursive  relations 
that  are  not  Ajj,  i.e.  X®  fl  77®  properly  extends  A®. 
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Finally, 

A is  analytical  iff  A is  definable  in  A2  . 

Number  quantifiers  can  be  shifted  to  the  right  and  like  function  quantifiers  con- 
tracted using  the  following  equivalences  and  their  duals: 

V°m3lx7?(m,  w,  x)  **  f xd°mR(m,  w,  (x)m) 

(*)  w,  x)  **  31x77((x)o(0),  w,  (x)i) 

x,  y)  **  3'x7?(w,  (x)o,  (x)i)  . 

(The  first  makes  use  of  AQ„  (“&>);  (*)  will  be  seen  to  be  necessary  for  the  classifi- 
cation of  relations,  leading  to  the  coming  12.7  and  other  basic  results.)  It  follows 
that  the  analytical  relations  can  be  classified  into  a hierarchy,  the  analytical  hier- 
archy, by  defining  the  (lightface!)  17,}  and  77,)  classes  for  n e co  as  follows:  First, 
for  later  inductive  schemes  set 

Sx0  = 17?  and  77()  = 77®  . 


Then  for  n > 0,  define 

A e Si  iff  Vw(A(w)  **  31xiV1X2  . . . QxnR(w,  xi, . . . , x„))  , and 
A e 77, J iff  Vw(A(w) V1xi31X2  . . . 2x„7?(w,  xi, . . . , x„)) 

for  some  arithmetical  R C.  ra>  x k+n{a,u>),  where  Q is  31  if  n is  odd  and  V1  if  n is 
even  in  the  first  and  vice  versa  in  the  second.  Also  define 

< = n'n . 

As  before,  S\  and  77, [ are  dual  collections,  17, [ U 77, [ C A*+1,  and  these  terms 
are  also  used  as  adjectives.  The  terminology  is  consistent  with  the  classification 
of  formulas  in  §0,  but  the  focus  on  relations  and  the  use  of  0>w  rather  than  V{co) 
makes  the  present  context  distinctive. 

The  quantifier  equivalences  (*)  above  show  that  the  classification  of  any 
relation  in  the  analytical  hierarchy  only  depends  on  the  function  quantifiers  and 
their  alternations',  this  is  frequently  relied  upon  without  further  mention.  Although 
the  foregoing  may  be  the  natural  formulation  of  the  hierarchy,  there  is  an  important 
self-refinement  that  leads  to  simple  normal  forms  (and  vindicates  our  definitions 
of  S(\  and  77,}): 

12.4  Exercise.  If  n > 0 is  even,  there  is  a A}}  R C '+<:+»+i&,  sucfj  that  the 
arithmetical  R(w,  x\,  , xn ) above  can  be  replaced  by 

3 °mR(m,  w(m),  x\ (m), . . . , x„(m))  . 

An  analogous  assertion  holds  for  n odd,  with  V°w  replacing  3 °m. 
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Hint.  Take  n > 0 even,  so  that  the  rightmost  function  quantifier  Q is  V1;  the 
other  case  follows  by  taking  negations.  First  change  each  number  quantifier  V°p 
in  a prenex  arithmetical  definition  of  R to  a function  quantifier  V'.v  by  replacing 
occurrences  of  p by  x(0).  Using  the  equivalences  (*),  shift  these  new  quantifiers 
to  the  left  and  contract  like  quantifiers.  Hence,  all  the  V°’s  of  R have  been 
absorbed  into  Q,  and  what  remains  is  of  form  3° pR0(p,w,  x\, . . . , xn)  where 
R0  is  /2®.  Being  definable  in  A2  with  only  bounded  quantifiers,  for  each  p, 
Ro(p,  w,  xi, . . . , xn)  only  depends  on  w(q),xi(q), . . . ,x„(q)  for  some  q e co 
sufficiently  large.  Thus, 

3°pR0(p.  w,  x\ ,xn)  3° p3°qRi(p,w(q),Xi(q), . . . ,xn(q)) 

for  some  /?i , and  the  quantifiers  can  be  contracted  to  complete  the  argument. 

H 


Having  defined  the  effective  (“lightface”)  hierarchies  the  correlation  with 
the  Borel  and  projective  (“boldface”)  hierarchies  can  be  made  through  the  simple 
expedient  of  relativization  to  real  parameters.  For  a e “co,  second-order  arithmetic 
in  a is  the  expanded  structure 

A2 (a)  — {co,  “co,  ap,  +,  x,  exp , <,  0,  1,  a) 

where  a is  regarded  as  a binary  relation  on  co.  Replacing  A2  by  A2  (a)  in  the 
preceding  there  are  the  corresponding  relativized  notions: 

in  a 

recursive  in  a 
arithmetical  in  a 
27»,  V7» 
analytical  in  a 
£>)  , K (a)  , A'ja)  . 

The  following  is  a simple  observation  that  will  be  used  without  further  men- 
tion: 

12.5  Exercise.  For  a e “co,  R c rw  x k(a>oj)  is  27®  (a)  iff  there  is  a 1 7®  S c 

r u>  x t+1(<ucu)  such  that  Vw(R(w)  -o-  ,S’(w,  a)).  Hence,  analogous  statements  hold 
for  Hi  Hi  Si  and  Hi  H 

This  provides  a way  of  defining  the  relativized  hierarchies  in  ^42,  but  one 
has  to  be  careful:  although  Ajfa)  — n„(a)  D n„(a),  12.5  does  not  hold  with  the 
27® ’s  replaced  by  Axn. 

Here  is  the  correlation  with  the  classical  hierarchies: 

12.6  Proposition.  Suppose  that  A C k(">oj)  and  n > 0.  Then: 

(a)  A e L®  iff  A e 27®(a)  for  some  a e 10  co,  and  similarly  for  II®. 

(b)  A e 27,*  iff  A e 27, } (a)  for  some  a e m co , and  similarly  for  II,*. 
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Proof,  (a)  For  n — 1,  suppose  first  that  A e 2®.  Using  our  A®  decoding  of 
S/  e <wto  from  i and  laying  out  the  codes  for  the  basic  open  sets  that  make  up  A 
in  blocks  of  length  k,  there  is  an  a e “&>  such  that  for  any  w e k(0>co), 

A(w)  3°m(w  e 0{sa(mk))  x 0(sa(m*+p)  x . . . x O (sa(mk+k-i)))  ■ 

Hence,  it  is  readily  seen  that  A e 2®  (a).  Conversely,  if  A e X7® (or)  for  some 
a e “ co,  then  for  any  w e k(fco), 

A(w)  3 °mR(tn,  w) 

where  R is  recursive  in  a.  For  each  m,  R(m,  w)  only  depends  on  w (q)  for  some 
q.  Consequently  each  Rm  = {w  | R(m,  w)}  is  open,  and  so  is  A = |Jm  R,„. 

The  II®  case  is  an  immediate  consequence  of  the  2®  case,  and  the  2®+1  case 
follows  from  the  II®  case  by  an  argument  similar  to  the  base  case. 

(b)  This  is  a direct  consequence  of  the  definitions,  12.4,  and  the  n = 1 case 
of  (a).  H 

Thus,  via  2®(a)  fl  /7®(a)  being  the  collection  of  relations  recursive  in  a we 
have  Addison’s  dictum  mentioned  earlier  that  the  clopen  subsets  of  a to  are  exactly 
those  sets  recursive  in  some  real.  The  uniform  notation  of  2,  2,  and  so  forth  that 
has  been  used  all  along  was  advocated  by  Addison  [58]  to  reflect  the  correlations. 
Results  will  often  be  stated  in  terms  of  the  2(J(a)’ s for  the  effective  content;  these 
renderings  imply  of  course  the  corresponding  results  about  the  classical  2^’s. 

There  is  also  an  effective  analogue  of  the  transfinite  Borel  levels  2®,  II® 
for  a > to  (see  Hinman  [78;  IV§4]).  For  relations  c rto,  the  original  formula- 
tion through  integer  notations  for  ordinals  is  Kleene’s  hyperarithmetic  hierarchy, 
indeed,  its  study  was  the  major  motivation  of  his  formulation  of  the  analytical  hi- 
erarchy. In  the  classical  context,  extending  the  hierarchy  Uo<n««  °f  the  finite 
level  Borel  sets  into  the  transfinite  results  in  the  Borel  sets,  which  by  Suslin’s  fun- 
damental result  12.1  coincides  with  the  A[  sets.  In  the  effective  context,  extending 
the  hierarchy  2®  of  arithmetical  sets  into  the  transfinite  through  analogous 
but  recursive  procedures  results  in  the  hyperarithmetic  sets,  which  by  Kleene’s 
analogue  for  12.1  coincides  with  the  A\  sets.  (For  more  on  hyperarithmetic  the- 
ory, see  Sacks  [90:  A§II].) 

Having  established  a working  context  for  descriptive  set  theory,  we  now 
pursue  the  development  of  the  subject  through  its  classical  results.  The  emphasis 
will  be  on  the  traditional  relations  c k(coa>),  although  relations  c ' oo  x k(fto)  can 
be  easily  accommodated  with  only  notational  complications. 

The  first  important  idea  of  descriptive  set  theory  was  the  use  of  Cantor’s 
universal  enumeration  and  diagonalization  argument  to  effect  a hierarchical  tran- 
scendence. Borel  [05:  note  III]  had  mentioned  the  idea,  and  Lebesgue  [05]  incor- 
porated definability  considerations  to  achieve  his  hierarchy  results.  An  application 
is  given  here  in  set  form  to  show  that  the  projective  hierarchy  is  proper.  For  any 
collection  r of  sets, 
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U is  universal  for  r iff  (i)  U c 2(®&>)  and  U e T , and 
(ii)  for  any  A c. 01  co,  A e r iff 

A = {x  | (a,  x)  e U)  for  some  a e “&> . 

For  0 < k e co  any  homeomorphic  identification  of  k(wco)  with  woo  yields  subsets 
of  k+ 1 (wa>)  in  r analogously  universal  for  the  subsets  of  k(a>co)  in  r,  a concept 
that  occurs  in  the  following  proposition. 

12.7  Proposition  (Luzin  [25c];  Sierpinski  [25]).  For  each  n > 0,  there  is  a 27„' 
set  universal  for  27,',  (and  so  a 77,)  set  universal  for  II, *^). 

Proof.  First  note  that  for  0 < k e co  there  is  a 27®  set  Wk  g k+1(wa>)  universal  for 
27®  subsets  of  kCa>),  i.e.  for  any  A g k(0>co), 

A e 27®  iff  A — [w  | Wk(a , w)}  for  some  a e ® co  . 

Simply  set 

Wk(a,  w)  iff  3°m(w  e 0(sa(mk))  x 0(sa(m*+i))  x ...  x 0(sa(m*+i_i))) 

as  in  the  proof  of  12.6(a).  It  is  now  straightforward  to  check,  recalling  12.4,  that 
if  U„  is  defined  by 

U„ (a,  x)  iff  3*xiV1X2  . . . V'x„  Wn+\ (a,  x,  x\, . . . , x„) 

when  n is  even,  and  with  V'x„  replaced  by  3 lxn  and  Wn+ \ by  its  complement 
when  n is  odd,  then  U„  is  universal  for  L*.  The  parenthetical  remark  about  II,' 
follows  from  taking  the  complement  of  Un . H 

12.8  Corollary.  For  each  n > 0,  27, } g A,1,,  77, J g A,1,,  and  Aln+l  g 27, ' U IT*. 

Proof  This  uses  Cantor’s  diagonalization  argument.  With  U„  as  in  the  previous 
proof,  set  Dn(x ) Un(x,  x)  for  any  x e w co.  Then  Dn  e 27, J,  and  D„  f II,': 
Otherwise,  —<D„  =‘0co—  D„  e 27,'  and  there  would  be  an  a e w a>  such  that  for  any 
x e ma),  —•D, ,(x)  U„(a,  x).  But  then,  there  is  the  paradigmatic  contradiction 

—•Dn(a)  **  Un(a,a ) **  Dn(a)  . 

That  77,)  g A,1,  follows  by  considering  and  that  A,'|+1  g 27,',  U II,1,  follows  by 
considering  the  recursive  union 

En  = {(x)°  | x e D„}  U[(x)'  | xfD„] 

where  for  i < 2,  (x)'  e wco  is  given  by 

)i  if  m — 0 , and 

x(m  — 1)  if  m > 0 . 

En  e Aln+1  since  D„  e 27, [ and  -<D„  e 77, J,  yet  En  f 27*  U II*  since  both  D„  and 
—•D„  can  be  recovered  from  it.  H 
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Note  that  the  diagonalization  argument  shows  that  there  can  be  no  A*  set 
universal  for  A*.  An  analogous  argument  with  number  quantifiers  shows  that  the 
finite  levels  of  the  Borel  hierarchy  are  proper.  Lebesgue  [05]  similarly  showed 
that  all  levels  of  the  hierarchy  are  proper  (and  bearing  on  his  skepticism  of  the 
Axiom  of  Choice,  AC  0fmaj)  is  nonetheless  seen  to  be  necessary  here  to  choose 
cofinal  sequences  for  limit  ordinals). 

The  next  important  idea  of  descriptive  set  theory  was  to  analyze  sets  by  asso- 
ciating ordinals  to  their  members.  With  roots  in  the  Lebesgue  paper  and  first  appli- 
cations in  Luzin-Sierpinski  [18],  the  approach  became  basic  in  Luzin-Sierpinski 
[23],  a fundamental  paper  on  analytic  sets.  Establishing  some  terminology  in 
sufficient  generality  for  later  use,  stipulate  for  any  k e co  and  set  Y that 

T is  a tree  on  kco  x Y iff  (i)  T c Ume<u  kim(n)  x mY  , and 
(ii)  if  {t0,  ...,tk)e  T,  then 

(to\m,  . . . , tk\m)  e T for  any  m < |?ol  ■ 

(When  k — 0,  T is  a tree  on  Y iff  T c <mY  and  is  closed  under  the  taking  of 
initial  segments;  analogous  conventions  apply  when  Y — 0.)  Thus,  T consists  of 
(k  + l)-tuples  of  finite  sequences  of  the  same  length  and  is  a tree  in  the  sense  of 
§7  under  a natural  initial  segment  ordering.  [T]  denotes  the  collection  of  infinite 
branches  of  T : 

(x0, ...,  xk—  i , f)  e [T]  iff  (xq,  . . . , xk—\ , />  e kCco)  x mY  and 

V°m((xo|m, . . . , xk~i\m,  f\m)  e T)  . 

Also,  for  w = (.vo,  ■ • • , x/_i)  e /(®cu)  with  / < k,  T „ is  that  tree  on  k~la)  x Y 
defined  by 

{t0, ...,  tk-t)  e T„  iff  3°m((x0|m, . . . , x/_i|  m,  t0, ...,  tk-i ) e T)  . 

Tx  is  written  for  T(x)- 

Next  define  C*  to  be  coordinate-wise  inclusion  for  n-tuples,  i.e. 

(xi,  . . . , x„)  C*  (yi,  . . . , yn)  iff  x,-  C y,  for  1 <('<«, 

and  let  c*,  3*,  and  D*  have  the  derived  meanings.  Also  define  to  be 
coordinate-wise  concatenation  for  n -tuples  of  sequences,  i.e. 

(xi, . . . , xn)'~'*  {y\, . . . , y„)  = (xf  y\, ...,  xfyn)  . 

When  k — 0 or  Y — 0 in  the  definition  of  T,  D*  is  regarded  as  reverting  back  to 
D and  so  forth. 

In  these  terms,  some  further  terminology  for  a T as  above  is  developed:  First, 
for  any  set  t, 

T/t  = {s  | f~*s  e T)  . 

Next, 

T is  well-founded  iff  [T]  = 0 , 
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i.e.  reverse  coordinate-wise  inclusion  D*  is  a well-founded  relation  on  T . (In  this 
context,  salient  is  the  lack  of  infinite  descending  chains.  The  equivalence  of  this 
with  the  usual  formulation  of  well-foundedness  requires  DC  in  general,  but  the 
focus  will  be  on  well-orderable  T’s  and  hence  well-orderable  7”s,  for  which  DC 
is  not  needed.)  For  such  T,  the  rank  function  pr-  T ->  On  is  defined  in  the  usual 
way  (cf.  0.3)  by: 

Pr(t)  = sup({p7  (.v)  + 1 | 0*1)), 

noting  that  pr(t)  = sup(0)  — 0 iff  t is  maximal  in  D*.  The  height  of  T is  then 
defined  by 

\\T\\-  ran (pT)  . 

Note  that  ||r||  is  an  ordinal,  indeed  ||T||  = pr((0,  . . . , 0)),  and  generally  pj(t)  = 
\\T/t ||.  pr  is  a basic  example  of  an  order-preserving  map  p:  (T,  D*>  ->  (On,  <), 
i.e.  if  s,  t e T with  s D*  t,  then  p(s)  < pit).  There  is  also  the  concept  of  an 
order-presen’ing  map  p:  T\  ->  Ti  between  two  trees  7)  and  Ti,  i.e.  if  s,  t e 7) 
with  s D*  t,  then  pis)  D*  pit).  The  following  are  simple  but  crucial  observations; 
no  appeal  will  be  made  to  (a)  unless  Y is  well-orderable. 

12.9  Exercise. 

(a)  For  a tree  T on  kco  x Y,  T is  well-founded  iff  there  is  an  order-preserving 
map  p:  T — > On.  If  such  a p exists,  then  prit)  < pit)  for  every > t e T,  and 
Pt : T ->  maxdcu!,  |F|+}). 

(b)  For  trees  S and  T,  there  is  an  order-preserving  map:  S — >•  T iff  either  T 
is  ill-founded  or  else  ||5||  < ||T||. 

Hint.  For  (a),  recall  0.3. 

For  (b),  if  /:  S -*  T is  order-preserving  and  T is  well-founded,  then 
Pr°f : S ->  On  is  order-preserving,  so  by  (a), 

lisil  =ps(0)<M/(0))<  liry . 

For  the  converse,  if  T is  ill-founded,  then  fixing  an  infinite  branch  through 
T,  S can  be  mapped  in  an  order-preserving  manner  into  T by  assigning  all  the 
elements  in  a level  to  the  single  node  on  the  branch  at  the  corresponding  level.  If 
T is  well-founded  and  ||S||  < ||r||,  then  proceed  by  induction  on  ||T||:  For  each 
(i)  e S,  ||5'/{i)||  < ||r||,  and  so  there  is  a tj  e T such  that  t ,•  f 0 and  ||S/(;)||  < 
||  T jti  || . Let  f:  S/(i)  -»  T /t,  be  order-preserving  and  define  /:  S ->  T by 

[0  if  s = 0 , and 

m 

[trfi(s)  if  s = (i)~s . 


Trees  provide  a simple  analysis  of  closed  sets: 

12.10  Proposition.  For  C c k (":o>) , C is  closed  iff  there  is  a tree  T on  ka:>  such 
that  C = [T]. 
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Proof.  Take  k = I for  notational  simplicity.  Given  C,  set 
T = {x\m  | x e C A mew] 

so  that  C c [7],  If  C is  closed,  then  for  any  y f C there  is  an  m e co  such  that 
0{y\m)  n C = 0,  so  y f [T\.  Hence,  C = [7  ], 

Conversely,  if  T is  a tree  on  co  and  x f [T],  then  for  some  in  e co,  x\m  f T 
and  consequently  0(x\m ) fi  [T]  = 0.  Thus,  “ co  — [T]  is  open.  H 

The  next  section  begins  with  a tree  representation  of  n}  sets  and  proceeds 
to  the  consequent  classical  results. 
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13.  Il}  Sets  and  Sets 

This  section  is  devoted  to  the  classical  analysis  of  n[  sets  and  £,  sets,  building  on 
the  groundwork  laid  in  the  previous  section.  As  described  there,  the  development 
can  be  carried  out  in  ZF  + Ac„r«). 

Well-founded  trees  as  articulated  at  the  end  of  the  previous  section  provide 
a basic  representation  of  n[  sets,  the  classical  result  later  given  effective  content: 

13.1  Theorem  (Luzin-Sierpinski  [23],  Kleene  [55]).  Suppose  that  a e <0a>  and 
A C k (far).  Then  A is  n\(a)  iff  there  is  a tree  T on  kco  x a>  such  that  for  any 
w e kicoco), 

A(w)  **  7W  is  well-founded  , 
and  the  relation  { (i,  w)  | S;  e T„}  is  recursive  in  a. 

Proof.  This  can  be  seen  as  a corollary  to  12.10,  but  is  established  anew.  Given 
such  a representation,  A(w)  is  clearly  77/ (a): 

A( w)  V1y3°/3°p(s(-  = y\p  A s*  Tw)  . 

For  the  converse,  take  k = 1 for  notational  simplicity.  By  12.4,  there  is  an 
R c 3oi  recursive  in  a such  that  for  any  x e :"o>, 

A(x)  <*V1y:30mR(m,x(m),y(m))  . 

Let  T be  that  tree  on  a>  x a>  consisting  of  the  “unsecured”  sequences: 

T = {(s,  t)  | V0/7  < \s\  + H-'RCp,  (s(0), . . . , s(p  - 1)>,  (r(0), . . . , tip  - 1))))}  . 

Then  A(x)  iff  Tx  is  well-founded,  and  {(/,  x)  \ s,-  e Tx}  is  a relation  recursive 
in  a.  H 


By  this  means,  for  any  II  [ set  A with  corresponding  tree  T an  ordinal  ||rw|| 
can  be  associated  to  each  w e A.  Actually,  Luzin-Sierpinski  [23]  focused  on 
well-orderings,  and  Kleene  [55]  on  integer  notations  for  ordinals.  Both  in  effect 
linearized  trees  using  an  ordering  that  first  appeared  in  the  earlier  paper,  although 
it  is  now  known  as  the  Kleene-Brouwer  ordering : For  s,  t e <foOn, 

s <kb  i ff  either  s D ( or  else  for  the  least  i 
such  that  sii ) f t(i),  sii)  < tii ) . 

<kb  is  a strict  linear  ordering  of  <fflOn.  Note  that  it  is  s D t in  the  definition 
rather  than  s C t.  As  a simple  way  to  remember  the  ordering,  .y  <kb  t iff 
s^(oo,  oo,  oo, . . .)  is  lexicographically  less  than  t''(oo,  oo,  oo,  . . .)  where  oo  is 
formally  greater  than  all  the  ordinals. 

13.2  Exercise  (Luzin-Sierpinski  [23]).  If  T is  a tree  on  y,  then  T is  well-founded 
iff  T is  well-ordered  by  <kb-  Hence,  well-founded’’ can  be  replaced  in  13.1  by 
well-ordered  by  <kb  H 
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Hint.  Suppose  that  (t,  | i e to)  is  an  infinite  <kb  descending  chain  in  T . The 
tj’s  for  i > 0 cannot  be  the  empty  sequence,  and  since  {f,  ( 0)  | 0 < i < co)  is  a 
non-increasing  sequence  of  ordinals,  there  is  a A'o  e co  such  that  f,( 0)  has  a fixed 
value  y(0)  for  i > k o.  The  tj’s  for  i > ko  cannot  have  length  1,  and  so  there  is  a 
k\  e w with  ko  < k\  such  that  f,  (l)  has  a fixed  value  y(l)  for  i > k\.  Continuing, 
the  resulting  y e “’co  verifies  that  T is  ill-founded.  H 

The  development  is  carried  out  in  the  more  immediate  terms  of  well-founded 
relations.  Not  only  have  they  become  as  basic  in  set  theory  as  well-orderings, 
but  later  generalizations  were  to  establish  direct  connections  with  well-founded 
models. 

The  representation  13.1  provides  a basic  stratification: 

13.3  Proposition  (Luzin-Sierpinski  [18,23]).  Every  Ilj  (and  hence  Ilj)  set  is  both 
a union  and  an  intersection  of  Kj  Borel  sets. 

Proof.  Suppose  that  A C k(a>co)  is  Ilj,  and  T is  a tree  on  kcox  co  as  given  by  13.1 
so  that  for  any  w e 

A( w)  **  Tw  is  well-founded  . 

For  s e <mco  and  a < co\,  set 

Bq  = { we  kCco)  | Tw/s  is  well-founded  with  ||Tw/.y||  < a}  . 

Then  it  follows  by  induction  on  a that  these  sets  are  Borel: 

Bo  = ni6<8{w  e \aco)  | s~(i)  i Tw]  , 

and  since  .s'"  (;)  ^ Tw  depends  only  on  the  initial  segment  w(|^|  + 1),  B q is  Borel, 
in  fact  II®.  For  a > 0,  note  that  a tree  U on  co  is  well-founded  with  ||[/||  < a 
exactly  when  for  any  i e co  there  is  a /l  < a such  that  U /(i)  is  well-founded  with 
\\U/m<P-  Hence, 

K = r\,M<aBp  {l) 

is  Borel  by  induction.  A is  consequently  a union  of  Ki  Borel  sets: 

a = U<„  B»  . 

To  show  that  A is  also  an  intersection  of  Kj  Borel  sets,  for  a < co\  set 

Ca  = {w  e kCco)  | (Tw  is  well-founded  with  ||  Tw||  < a)  v 

e <<0w{T^/s  is  well-founded  with  ||rw/i||  = a)}  . 

That  the  Bfs  are  Borel  implies  in  a straightforward  manner  that  the  Cf  s are 
Borel.  We  shall  verify  that 

^ = na<(«,  Ca  ■ 
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If  w e A,  then  Tw  is  well-founded,  so  that  for  any  a < w\  either  ||rw||  < a 
or  else  there  is  an  s e <u>w  such  that  HTw/sH  = a,  i.e.  w e Ca.  On  the  other 
hand,  if  w ^ A,  then  rw  is  ill-founded,  and  also  for  any  s e <alw  there  is  at  most 
one  P < w\  such  that  Tw/s  is  well-founded  with  ||  Tw/s ||  = ft.  But  since  <tow  is 
countable,  there  must  be  some  a < w\  such  that  w <£  Ca.  ~\ 

The  Borel  sets  B%  figure  in  a general  bounding  principle  for  analytic  sets: 

13.4  Proposition  ( Luzin- Sierpinski  [18,23]).  Suppose  that  A C k(wco)  is  Ilj  with 
a corresponding  tree  T on  ka>  x w as  given  by  13.1,  and  B C A is  £[.  Then  there 
is  an  a < a>\  such  that 

B c {w  e k(fw)  \ Ty,  is  well-founded  with  ||  Tw||  < a]  . 

Proof.  Taking  k — 1 for  simplicity,  by  12.4  let  U be  a tree  on  co  x a>  so  that 
B — p[U ],  i.e.  for  any  w e 

w e B **  31y((ui,  y)  e [I/])  . 

For  new  and  s,t,u  e nw,  set 

Bs.t.u  = {w  e B \ (s,  t)  e T a s c w a 31y((u;,  y)  e [17]  A u c y)}  . 

Assume  now  to  the  contrary  that  sup({||7’„)||  \ w e B})  — w\.  Define 
sn,tn,un  e "or  for  every  new  such  that  s„  c sn+ 1 , tn  c tn+  \ , u„  c un+l, 
and 

(*)  sup({||ru)/t„||  | w e = w\ 

by  recursion  on  n as  follows:  Set  s0  = t0  — uq  — 0,  so  that  BSlhUhU0  — B and 
(*)  holds  for  n — 0 by  assumption.  Given  sn,tn,un  satisfying  (*),  since  there 
are  only  countably  many  one-step  extensions  of  s„,  t„,  u„,  and  since  ||  Tw/t„ ||  = 
sup([||7’u,/f)['(/}||  + 1 | i e ft;}),  recursively  define  s„+i  ^ sn,  un+\  3 and 
tn+ 1 2 t„  so  that  (*)  is  satisfied  with  n replaced  by  n + 1. 

Set  w = U„  sn , * = Un  bi , and  y = |Jn  un . Then  it  is  easy  to  see  that 
(w,  y)  e [17],  i.e.  w e B,  yet  Tw  is  ill-founded  since  x e [7’,„],  Contradiction!  H 

The  foregoing  proof  was  drawn  from  the  original  publications.  Curiously, 
recent  texts  give  the  following  less  direct  proof: 

Second  proof  of  13.4.  Assume  to  the  contrary  that  for  any  a < w \ there  is  a w e B 
with  ||  T^w  ||  > a.  It  will  be  shown  that  any  Ilj  set  of  reals  is  also  £},  contradicting 
the  hierarchy  result  12.7.  So,  suppose  that  C c Mw  is  Ilj  with  a corresponding 
tree  U on  w x w as  given  by  13.1  so  that  for  any  x e mw, 

C(x)  **  Ux  is  well-founded  . 

By  12.9(b)  and  the  assumption  on  B,  it  follows  that 
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C(x)  **  3w(B(w)  A 3 /(/:  Ux  ->  Tw  is  an  order-preserving  map))  . 

But  the  3/  . . . can  be  rendered  in  L]  form  as 

31  vV0/V°y  (s/  e Ux  a s,-  e Ux  a s,-  d Sj 

Sy(i)  €=  rw  A S y(j)  £ 7W  A Sy(i)  Z)  ^y(j))  • 

Hence,  C is  L}.  H 

A direct  consequence  of  13.4  is  the  Borel  separability  of  analytic  sets  (Luzin 
[27:  50ff]):  If  X,  Y c k(‘°co)  are  disjoint  L[  sets,  there  is  a Borel  set  Z c k(0>w) 
such  that  X C Z and  Y D Z = 0.  This  in  turn  leads  directly  to  the  fundamental 
result  that  began  descriptive  set  theory: 

13.5  Exercise  (Suslin  [17]).  A set  is  Borel  iff  it  is  A}. 

Hint.  For  the  forward  direction,  show  that  the  Aj  subsets  of  k(0>co ) form  a a- 
algebra;  closure  under  countable  unions  follows  from  the  quantifier  equivalences 
(*)  before  12.4.  H 

This  is  essentially  the  first  published  proof  of  Suslin’ s result,  which  appeared 
in  Luzin-Sierpinski  [18].  Luzin-Sierpinski  [23]  started  with  a formulation  of  ana- 
lytic sets  differing  from  Suslin’ s,  one  that  essentially  regarded  reals  as  codes  for 
relations.  In  our  context,  any  x e r"o)  encodes  a binary  relation  Ex  on  o>  given  by 

(m,  n)  e Ex  iff  x((m , «))  = 0 . 

If  Ex  is  well-founded,  it  has  a rank  function  px  defined  recursively  on  co  by: 

Px(n)  — sup ({px(m)  + 1 | ( m , n)  e Ex})  , 

so  that  px(n)  = sup(0)  = 0 iff  n is  minimal  in  Ex  or  else  n is  not  in  its  field. 
(The  latter  is  allowed  for  the  convenience  of  having  dom(pA  ) = co.)  Define 

||v||  = sup ({px(n)  I n e co})  . 


Now  set 

WF  = {x  e OJco  | E_x  is  well-founded]  , and 
WF„  = [x  e WF  | ||x||  < a] 
for  a < co\.  WF  is  n\,  since  for  any  x e wco, 

x e WF  V1y3°m(x((y(m  + 1),  y(m)»  / 0)  . 

In  terms  of  13.1,  this  is  already  in  normal  form,  with  the  corresponding  tree  on 
co  x co  of  “unsecured”  sequences  being 

rWF  = {{s,  f>  e x mco)  | 

v0/  < \S\  - ma  + 1),  td))  < i^i  ->  s((td  + 1),  mm  = o)} . 
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For  .x  e WF,  ||x||  = ||7’J[WF||  and  so 

WF„  = {x  e a>ca  | is  well-founded  and  HT^H  < a } . 

The  following  is  now  a simple  consequence  of  the  argument  for  13.3  and  of  13.4: 

13.6  Exercise  (Luzin-Sierpinski  [23]). 

(a)  For  any  a < oo\,  WF„  is  Borel. 

(b)  If  B c WF  is  Ej . then  there  is  an  a < co\  such  that  B C WFa. 

(c)  WF  € 77j  — E[.  H 

Spector  [55]  established  an  effective  version  of  (b)  in  terms  of  well-orderings 
for  (lightface)  S\  sets  and  with  a>i  replaced  by 

cofK  = sup] ||x ||  | x e WF  A (the  graph  of)  x is  recursive]  . 

This  Xj1  boundedness  theorem  is  a cornerstone  of  hyperarithmetic  theory  (see 
Sacks  [90:  A§II]). 

(c)  provides  a natural  example  of  a member  of  n[  — E] . Note  that  with 
the  first  proof  of  13.4,  this  approach  to  n[  — 2}  ^ 0 does  not  depend  on  the 
diagonalization  argument  for  12.8.  By  13.1,  for  any  77j  (a)  set  A c k(fco)  there  is  a 
function  f:  k(‘°co)  ->  w(o  (whose  graph  is)  recursive  in  a such  that  A — /_1( WF). 
Flence,  WF  is  “complete”  for  this  sort  of  reducibility  for  the  IT}  sets.  Versions 
of  WF  and  (a)  had  already  occurred  in  Lebesgue  [05],  but  he  could  not  get  to  (c) 
because  of  a crucial  mistake,  discussed  earlier,  that  spurred  Suslin’s  discovery  of 
the  analytic  sets. 

Mazurkiewicz  [36]  provided  a more  mathematical  example  of  a member  of 
nj  — L}.  Since  a continuous  function:  R — > R is  determined  by  its  values  on  the 
rationals,  for  present  purposes  such  functions  can  be  identified  with  members  of 
10 a>  in  a systematic  way  much  as  codes  for  open  sets  were  developed  (see  before 
11.8).  Mazurkiewicz  showed  that 

{/:  R R | / is  differentiable]  e Ilj  — L[  . 

See  Becker  [92]  for  an  account  of  recent  work  along  these  lines. 

The  first  significant  result  about  the  second  level  of  the  projective  hierarchy 
was  the  following  extension  of  part  of  13.3: 

13.7  Proposition  (Sierpinski  [25]).  Every  set  is  a union  of  K|  Borel  sets. 

Two  proofs  are  provided;  the  first  is  the  original  one  and  uses  AC,  and  the 
second  (see  after  13.14)  does  not. 

First  proof  of  13.7.  Suppose  that  A C k(a>w)  is  L},  with  II}  B C ^f^co)  such 
that  for  any  w e k(‘°co), 

A(w)  **  3'yB(w,  _y)  . 

By  13.3,  B — where  each  Ca  is  Borel,  and  so 
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A(w)  **  3ly3a  < a>\ C„(w,  y) 

3a  < co\3lyCa{v<,  y)  . 

For  each  a < a>\,  Da  — {w  e k(fco)  | El'yCatw,  y)}  is  L[,  and  so  again  by  13.3 
Da  = Uf<(Ul  where  each  £“  is  Borel.  (The  choice  for  each  a < a>\  of  a 
family  {Zs|  | § < an)  uses  AC.)  Hence, 

A = LU*  Uf<a,  EaH  • H 

With  the  perfect  set  property  for  Borel  sets  (see  12.2(c))  Sierpinski  was  able 
to  conclude  that  every  £,  set  h as  cardinality  at  most  Ki  or  a perfect  subset  and 
hence  cardinality  2^°.  (This  result  is  improved  by  14.9  in  a modem  setting.) 

The  main  classical  results  about  nj  and  were  established  using  these 
ideas  (without  the  effective  content,  of  course),  and  Kondo’s  IlJ  Uniformization 
theorem  (see  13.17),  considered  difficult  for  some  time,  was  a culminating  result 
of  the  ordinal  analysis  of  IT}  sets.  13.7  was  to  become  paradigmatic  for  reducing 
the  analysis  of  projective  sets  to  unions  of  Borel  sets;  digressing  somewhat,  we 
briefly  take  a look  at  work  in  this  vein  in  the  post-Cohen  era: 

The  converse  to  13.7, 

any  union  of  B i Borel  sets  is  E1  , 

fails  in  ZFC  + CH,  since  any  set  of  reals  of  cardinality  Bi  is  a union  of  Ki  Borel 
sets,  namely  the  singletons  of  its  members.  Notwithstanding,  Luzin  [35]  consid- 
ered the  possibility  of  that  proposition  holding,  and  it  is  actually  a consequence 
of  ZFC  + MA  + -•CH  + 3a  e “tt>(&>f[fl]  = oq)  (Martin-Solovay  [70:  §3]).  This 
incidentally  showed  that  one  cannot  prove  in  ZFC  that  every  II)  set  is  a union  of 
Ki  Borel  sets.  Solovay  showed  that  it  is  consistent  to  have  2K°  large  and  a II)  set 
that  is  not  the  union  of  fewer  than  2K°  Borel  sets. 

Considering  arbitrary  sets  of  reals,  Levy-Solovay  [72]  first  observed  that  ZFC 
H — 'CH  implies  that  there  is  a set  of  reals  which  is  not  the  union  of  any  Borel 
sets,  and  went  on  to  show  that  some  substantial  use  of  AC  is  necessary:  They 
established  using  Solovay’s  Lebesgue  measure  arguments  that  Con(ZFC  + 3k(k 
is  inaccessible))  implies  Con(ZFC  H — 'CH  + every  “On-definable  set  of  reals  is 
a union  of  Ki  Borel  sets1)  (cf.  11.11),  and  passing  to  an  inner  model,  Con(DC  + 
a> i f 2K°  + ^every  set  of  reals  is  a union  of  Kj  Borel  sets1)  (cf.  11.1).  Finally,  in 
the  rather  different  setting  of  ZF  + AD,  the  Axiom  of  Determinacy,  the  converse 
to  13.7  does  hold,  and  moreover,  elegant  generalizations  through  the  projective 
hierarchy  have  been  obtained  (30.20). 

Regularity  Properties  in  L 

We  next  discuss  Godel’s  delimitative  results  with  V = L.  A conspicuous  feature 
of  the  modem  theory  is  the  study  of  reals  as  codes  for  countable  models.  Any 
.r  e encodes  the  binary  relation  Ex  — {( m,n ) \ x({m,n ))  = 0}  as  specified 
before,  and  consequently  a structure 
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Mx  = {co,  Ex) 

for  the  language  Ce  of  set  theory.  If  Mx  is  well-founded  and  extensional,  then 
by  the  Collapsing  Lemma  0.4  it  has  a unique  transitive  collapse  tr (Mx)  and  an 
isomorphism 

ttx:  Mx  tr (Mx)  . 

For  (p  a formula  and  s e <<0co, 

Mx  (= 

is  written  with  the  understanding  that  cp  has  at  most  the  variables  Vo,  . . . , fpi-i 
free  and  is  satisfied  in  Mx  with  the  variable  assignment  of  s(i)  to  v Temporarily 
specify  that  e co  is  the  Godel  number  of  formula  cp  in  some  fixed  recursive 
arithmetization  of  C€,  and  for  any  collection  0 of  formulas,  define  ^0^:  &>  ->  2 
by  ^0^{m)  — 1 iff  m — ^tp1  for  some  <p  e 0. 

13.8  Proposition. 

(a)  For  any  n e co, 

{( m , i,  x)  e 2 co  x 01  co  \ m = ^cp1'  A < p is  En  A Mx  \=  ^>[s,]} 

is  arithmetical,  in  fact  when  n > 0. 

(b)  For  any  collection  0 of  C€  sentences,  {x  e “ co  \ Mx  models  0}  is  A\{}0^). 

(c)  If  a e and  Ma  is  well-founded  and  extensional,  then 

{{ m , x)  e co  x M co  \ na{m)  — ,r} 


is  arithmetical  in  a. 

Proof,  (a)  and  (b)  follow  from  the  arithmetization  of  the  satisfaction  relation.  The 
first  can  be  established  inductively  following  the  definition  of  \="M  given  in  §0. 
The  second  recasts  the  AfF  definability  of  the  satisfaction  relation  for  set  structures 
(see  e.g.  Devlin  [84:41]  or  Drake  [74:92]). 

(c)  Using  (a)  and  the  definition  of  each  member  of  to  in  set  theory,  it  is  simple 
to  see  that 

{{Up)  e 2 oo  | it a (i ) = p} 

is  arithmetical  in  a.  Consequently,  if  cp  is  the  formula  (i>o,  iq)  e t>2,  then  for  any 
m e co  and  x e 01  co, 

tta{m)  — x<o- 

V° pW°q(x(p)  — q 3°n0j(7Ta(i)  — p A ita(j)  — q A Ma  \=  </>[i,  , 

which  shows  that  the  relation  is  arithmetical  in  a. 

It  can  be  checked  that  the  relation  in  (c)  is  n^ia). 

By  3.3(a),  there  is  a sentence  ero  such  that  for  any  transitive  set  model  N of 
(Tq,  N = Ls  for  some  limit  <5  > &>,  and  a formula  cpa{v\,  tb)  that  defines  in  L a 
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well-ordering  <L  of  L such  that  for  any  limit  S > co  and  x,  y e L$,  x <L  y iff 
L$  \=  <po[x,  y].  By  the  proof  of  CH  in  L the  well-ordering 

2Ca>)  n <L  — {{x,  y)  I x,  y e ma>  n L Ax  <L  y] 

has  ordertype  of. 

13.9  Theorem  (Godel  [51:  67]).  <°co  n L and  2(a>co)  fl  <l  are 
Proof.  Proceeding  directly  to  the  well-ordering,  for  any  x,  y e mbo  fl  L, 

x <l  y z3°  p3°q(Mz  is  well-founded  and  extensional 

A n z(p)  - x A 7i z(q)  = y A Mz  [=  a0  A (p0[p,  q\)  . 


By  13.8  this  is  E\.  H 

13.10  Corollary.  If  Ww  C L,  then  2(‘°a> ) fl  <L  is  a A\  set  which  is  not  Lebesgue 
measurable  and  does  not  have  the  Baire  property. 

Proof.  With  OJoj  C L,  2(wa> ) fl  <L  is  a well-ordering  of  all  the  reals,  and  so  it 
must  also  be  A\,  since  for  any  x,  y e wco, 

x <L  y **  x f y A ->(y  <L  x)  . 

Assume  now  that  2(wco)  fl  <L  is  Lebesgue  measurable.  For  any  y e wa>C 1 L, 
[x  e wa>  fl  L | x <l  y}  is  countable  and  hence  null,  and  so  2(coa> ) fl  <l  is  null 
by  Fubini’s  Theorem  0.10.  However,  the  same  argument  could  be  applied  to  its 
complement  {(x,  y)  e 2(‘°co)  flL|y<LxVx  = y}  to  show  that  it  too  must 
be  null,  reaching  a contradiction.  An  analogous  argument  using  the  Kuratowski- 
Ulam  Theorem  0.13  shows  that  2(‘°ol>)  n <L  does  not  have  the  Baire  property. 

H 


Godel  [38]  announced  the  Lebesgue  measure  result;  the  Baire  property  result 
seems  to  be  first  explicit  in  Mycielski  [64: 216].  The  following  refinement  of  13.9, 
establishing  in  the  parlance  that  in  L,  2("tt>)  fl  <L  is  a S\-good  well-ordering, 
led  to  the  further  results  of  descriptive  set  theory  in  L. 

13.11  Exercise  (Addison  [59]).  The  relation  ISl  C 2C"o>)  given  by 
1Sl(x,  y)  {(x),  | i e co}  = [z  e m(o  | z <l  >’} 

is  S\. 

Hint.  This  is  similar  to  13.9.  H 

The  delimitative  result  for  the  perfect  set  property  is  more  involved.  Godel 
never  published  a proof,  and  those  published  have  relied  on  Ilj  Uniformization. 
He  once  described  to  Levy  the  following  incisive  argument,  one  that  has  a wider 
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applicability  (20.21);  Solovay  also  came  to  essentially  the  same  argument  (cf.  Sil- 
ver [71b:  440]).  It  can  be  rendered  with  just  the  3.3(a)  properties  of  cto  and  cpo 
reviewed  before  13.8. 

13.12  Theorem.  Suppose  that  co[  — u>\.  Then  there  is  a 77*  set  of  reals  without 
the  perfect  set  property. 

Proof.  The  idea  is  to  get  a 77/  set  of  unique  codes  for  ordinals  less  than  co\ . For 
such  an  ordinal  a,  there  is  an  xq  e wco  fl  L such  that  MXo  is  a well-ordered  set 
with  ordertype  a.  To  specify  the  <l -least  such  xq  is  not  77/,  but  it  is  77/  to  say 
of  some  x\  e mco  n L:  MXl  = (L$,  e)  for  the  least  limit  ordinal  S > co  such  that 
some  xo  e Ls  is  the  <l\(Ls  x L^-least  possible  as  above.  Again,  to  specify  the 
<l -least  such  x\  is  not  77/,  but  it  can  be  similarly  specified  in  terms  of  some  MXl, 
and  so  forth.  The  resulting  (x,  | i e co)  can  then  serve  as  a unique  code  for  a. 

For  k e w and  x e “ co,  let  Mx\k  be  the  initial  segment  of  Mx  determined  by 
k ic  set 

Dx.k  = {n  | x(\n,  k))  = 0}  , and 
Mx\k  = {DXtk,  Ex\(Dx  k x Dxf))  . 

Recall  that  each  x e encodes  a countable  sequence  {(x),-  \ i e co)  with  each 
(x)i  e 01  oo.  The  description  above  then  leads  to  the  following  relation  on  “&>: 

A(x)  **  M(X)0  is  well-ordered  A V°/[M(x).+1  is  well-founded 
and  extensional  A A7(x)i+I  ^ <t0  a (x),-  e tr(A7(x).+1) 

A V°k(M(x)i+l \k  |=  cr0  -»  (x),-  i tr(MWi+1  \k)) 

A VlyV°pV°q[7TMi+l(p)  = y A 7t(x)i+l(q)  — (x),- 
A M(x)l+l\=  <po[p,q]  ->  My  ft  Mixh]\  . 

That  My  ^ M(V).  can  be  cast  as  a 77/  assertion  V1 /(/  does  not  code  an  iso- 
morphism: My  = M(x)j)  (cf.  the  second  proof  of  13.4),  and  so  applications  of 
13.8(a)(c)  and  a quantifier  analysis  show  that  A is  77/. 

If  A(x),  there  is  a corresponding  sequence  [or-  \ i e co)  of  ordinals  less  than 
oo\  such  that,  using  the  3.3(a)  properties  of  oq  and  <po, 

(a)  M(x)0  has  ordertype  af  and  for  each  i e co,  M{x)j+l  = (La*+],  e) 
and  af+]  is  the  least  limit  ordinal  > co  such  that  (x),  e Lay-+[  . 

(b)  For  each  i e co,  if  y <k  (x),-  (in  Laj+i  and  hence  in  L ),  then  My  ^ A7(x)i  . 

Moreover,  it  is  simple  to  see  that  for  any  a < cof  there  is  a unique  x e wco 
such  that  A(x)  and  = a.  Hence,  A is  a 77/  set  of  cardinality  \co\\. 

A cannot  have  a perfect  subset;  in  fact,  any  subset  B of  A is  countable: 
7 V — {(x)o  | x e 71}  is  also  L] , and  in  terms  of  13.6,  7 V c WF  so  that  B'  c WF^ 
for  some  ft  < oo\.  This  means  that  ||(x)o||  = a/ [ < for  x e B,  and  so  by  the 
uniqueness  of  the  coding,  |7?|  < \f}\. 

Finally,  the  assumption  co\  — co\  implies  that  A is  uncountable,  and  so  the 
proof  is  complete.  H 
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Guaspari  [73],  Kechris  [75],  and  Sacks  [76]  independently  showed  that 

C\  — [x  e “a>  | x e } , 

where  a>\  is  the  least  ordinal  not  recursive  in  x (i.e.  not  the  ordertype  of  any 
well-ordering  C 2(“&>)  recursive  in  x),  is  also  a 77/  set  of  cardinality  \(o\\  without 
a perfect  subset.  Moreover,  they  showed  that  any  n\  set  without  a perfect  subset 
is  a subset  of  C\,  and  provided  detailed  analyses  which  are  generalized  through 
the  projective  hierarchy  by  Kechris  [75]  under  strong  hypotheses. 

For  any  a e wa>,  L[a ] has  a canonical  well-ordering  <l[u]  with  properties 
analogous  to  </..  Hence  13.8-13.12  all  relativize  to  produce  corresponding  results 
about  L[a]  and  n\(a),  A\{a),  and  X'j  (a)  sets.  Moreover,  this  relativization  is 
uniform,  so  that  for  example  {(x,  a)  e 2C°&>)  | x e L[fl]}  is  S). 

Analysis  of  Sets 

A basic  tree  representation  of  sets  implicit  in  Shoenfield  [61]  provided  the 
backdrop  for  the  analysis  of  their  regularity  properties.  As  this  foreshadowed 
the  development  of  similar  representations  of  all  projective  sets  under  strong  hy- 
potheses, a general  setting  is  developed.  Recall  that  the  projection  of  a set  B is 
pB  — [x  | 3y((x,  y)  e B)j.  For  A c k(Ma))  and  set  Y, 

A is  Y-Suslin  iff  there  is  a tree  T on  kco  x Y such  that  A = p[T ] . 

This  last  can  be  further  expressed  in  our  tree  terminology  by:  For  w e k(0>co), 

A(w)  [rw]  / 0 

T„  is  ill-founded  . 

In  terms  of  a natural  topology  imposed  on  k(0)co)  x aY,  a subset  C is  closed  iff 
there  is  a tree  on  ka>x  Y such  that  C — [T]  (cf.  the  proof  of  12.10).  Thus,  being  Y - 
Suslin  is  a generalization  of  being  analytic.  While  it  may  be  necessary  to  replace 
w by  other  T,  such  representations  turn  out  to  have  significant  consequences  (see 
e.g.  §30),  and  so  in  anticipation, 

A is  Suslin  iff  A is  Y -Suslin  for  some  set  T . 

The  following  serves  to  frame  the  discussion,  with  all  but  (c)  and  (d)  making 
substantial  uses  of  AC. 

13.13  Proposition.  Suppose  that  A C k(":aj).  Then: 

(a)  A is  Y-Suslin  iff  A is  \ Y\-Suslin. 

(b)  A is  2**° -Suslin. 

(c)  A is  co-Suslin  iff  A is  £ | . 

(d)  If  k > 1 and  A is  k -Suslin,  then  pA  is  also  k -Suslin.  Consequently,  if 
n e co  and  eveiy  II*  set  is  K-Suslin,  then  so  is  every  £*+1  set. 

(e)  If  A|  is  a K-Suslin  subset  of  k (fed)  for  each  £ < k,  then  so  is  Uf<k- 

(f)  (Martin)  For  any  n > 0,  A is  a)„-Siislin  iff  A is  a union  of  Borel  sets. 
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(g)  Suppose  that  B C 1 (coa>)  is  Y-Suslin,  /:  k(°>co)  — > 1 (ojod)  is  continuous,  and 
A — Then  A is  Y-Suslin. 

Proof,  (a)  Only  the  cardinality  of  Y matters;  a bijection:  Y — >-  |F|  induces  an 
isomorphism  between  the  corresponding  trees. 

(b)  Assuming  that  A ^ 0,  let  A — | § < 2K°).  Then  if 

T — {(a||m,  m x {£}}  | m e a>}  , 


it  follows  that  A = p[T ]. 

(c)  This  follows  from  12.10  and  the  definition  of  analytic  sets;  cf.  13.1. 

(d)  If  a tree  T on  kco  x k is  such  that  A = p[T],  then  that  tree  recast  as  T' 
on  k~xco  x (oj  x k)  is  such  that  p A — p[T'].  The  result  now  follows  as  for  (a)  by 
applying  a bijection:  co  x k ->  k. 

(e)  Suppose  that  for  each  f < k there  is  a tree  on  kco  x k such  that 
Aj:  — p[T^\.  For  z £“«■  define  z+  e 01  k by:  z+(i)  = z(i  + 1).  Let  T be  that  tree 
on  ka>  x k such  that 


[T]  = {<w,  y)  | (w,  y+>  e r>-(0)}  . 

Then 

w e U|<K^  < /r3v((w,  y)  e [T?]) 

3z((w,  z+)  e [rz(0)]) 

3z((w,  z)  e [T])  . 

(f)  For  one  direction,  (c)  and  (e)  imply  that  a union  of  k Borel  sets  is  /r-Suslin. 

For  the  converse,  proceed  by  induction  on  n:  Suppose  that  T is  a tree  on  ka>  x &>„ 
with  n > 0 and  A = p[T\.  For  each  y < &>„,  let  TY  — T (T  x <ay).  Then 

A = 1J Y«o„  P\TY]  since  cf(«„)  > co.  But  by  (a),  each  p[Ty\  is  <u„_i-Suslin. 
Consequently,  if  n = 1 then  by  (c)  and  13.3,  p[TY]  is  a union  of  Ki  Borel  sets, 
and  if  n > 1,  then  p\l  yl{  is  a union  of  K„_  i Borel  sets  by  induction.  In  either 
case,  it  follows  that  A is  a union  of  K„  Borel  sets. 

(g)  Take  k — l — 1 for  notational  simplicity.  By  the  continuity  of  /,  for  any 
t e <aco  there  is  an  s e <wa>  such  that  O(s)  c f~1(0(t)).  A sort  of  converse  is 
needed,  so  first  note  that  for  any  s e <0’o>, 

Ws  = {t  e <<0(»  | OWcf'COW)} 

is  nonempty  as  it  contains  0,  and  for  t,  T e Ws,  either  t c t or  1 c t.  So,  let  6(s) 
be  the  largest  t e Ws  satisfying  |?|  < |s|. 

For  any  x e 

f(X)  = ifmzJW™)  ■ 

The  3 direction  follows  directly  from  the  definition  of  9.  For  the  C direction, 
suppose  that  n e co.  Since  / is  continuous  and  .r  e f~l(0(f(x)\n)),  there  must 
be  an  m e co  such  that  0{x\m)  C f~x(0(f(x)\n)).  Taking  such  an  in  > n,  it 
follows  that  f(x)\n  c 0(x\m). 
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Suppose  now  that  B = p[T\  with  T a tree  on  a>  x Y,  and  let  S be  a tree  on 
a)  x Y defined  by: 

S = {(i,  u)  | Is | = \u\  A (6>(s),m||6>(s)|)  e T]  . 

Then  for  any  x e 

xep[S]  3^“f((x,g)€[S]) 

3 g e wYV°m({9(x\m),  g\ \6(x \m)\)  e T) 

^ 3 geaY((f(x),g)e[T]) 
f(x)  e p[T ] 

« .t  e A . H 

With  ZF  the  ambient  theory  the  uses  of  AC  can  be  made  explicit  by  adding 
hypotheses,  e.g.  that  Y is  well-orderable  in  (a)  and  that  the  choice  of  trees 
( T * | £ < k)  is  given  beforehand  in  (e).  (f)  is  a ZFC  construction  principle 
which  will  be  applied  later  (15.13).  (b)  is  merely  a foil;  the  main  interest  is  in 
Suslin  representations  established  in  ZF  with  trees  having  definability  and  homo- 
geneity properties  (as  will  become  clearer  in  §15).  With  (c)  as  the  starting  point 
Shoenfield  provided  such  a representation  for  Y}2  sets: 

13.14  Theorem  (Shoenfield  [61]).  Every  set  is  o)\-Suslin.  In  fact,  if  a e wco 
and  A c *■  (“&>)  is  (a),  there  is  a tree  U e L[a ] on  kco  x u>\  such  that  A = p[U\. 

Proof.  The  result  is  first  established  assuming  that  A is  n\  (a).  Taking  k = 1 for 
notational  simplicity,  by  13.1  there  is  a tree  T e L[a\  on  a>  x a>  such  that  for  any 

x e a,co, 

A(x)  **  Tx  is  well-founded  . 

Consequently, 

A(x)  **  3g(g:  Tx  — >■  co\  is  an  order-preserving  map) 

by  12.9(a). 

This  can  be  recast  in  terms  of  getting  an  infinite  branch  through  a tree. 
Remembering  that  there  is  a fixed  recursive  enumeration  (s,-  i £ of  of  <0,w  with 
I s,- 1 < i,  define  a tree  T on  co  x co\  by 

T = {(5,  u)  | Vi,  j < \s\iSj  D Sj  A (i||s, I,  Si)  e T ->  u(i ) < k (;'))}  . 

It  is  straightforward  to  see  that  T e L[a],  being  definable  from  T and  u>\,  and 
that  for  any  x e “<o, 


A(x)  ^ 3 ge%((x,?)e[f]) 

**  x e p[f\  . 
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The  full  result  for  H\{a)  sets  now  follows  by  the  argument  for  13.13(d), 
using  a bijection:  co  x co\  **  co\  in  L[a]  at  one  point.  H 

The  w\  of  this  result  is  emphatically  in  the  sense  of  V and  could  well  differ 
from  co^a\  Note  that  in  the  full  AC  context  the  converse, 

any  wi-Suslin  set  is  > 

is  independent  by  the  discussion  (after  13.7)  of  the  proposition  “any  union  of  Ki 
Borel  sets  is  E,”  ar*d  the  connecting  13.13(f). 

The  13.14  representation  leads  to  a proof  of  the  classical  13.7  avoiding  AC: 

Second  Proof  of  13.7.  Suppose  that  A C k(mco)  is  Ej,  and  let  U be  a tree  on 
ka>  x co i such  that  A — p[U ].  For  each  y < u>\  let  UY  = U fi  ftf^co)  x <toy), 
so  that  A — P\UY ] as  cf(<«i)  > co.  This  follows  the  proof  of  13.13(f);  by 

other  parts  of  13.13  not  dependent  on  AC,  each  p[Uy ] is  w-Suslin  and  hence  L] . 
In  any  case,  we  can  now  proceed  as  in  the  proof  of  13.3  to  show,  uniformly  in  y, 
that  for  any  a < co\, 

Cy  — { we  k(°’co)  | {Uy  is  well-founded  and  ||f/£||  < a)  v 

3.?  e <wco\(UY/s  is  well-founded  and  ||f/^/i||  = a)} 

is  Borel,  and  arguing  as  in  that  proof  but  in  terms  of  complements,  p[Uy]  — 
U«<(B1  (*("")  - c«)-  Consequently, 


Shoenfield  established  an  important  absoluteness  result  with  this  analysis  of 
Li,  sets.  To  discuss  the  absoluteness  of  analytical  relations,  first  associate  to  each 
analytical  A a definite  normal  form  as  given  by  12.4.  For  instance,  if  A C 01  eo  is 
, then  fix  a recursive  R c 4co  such  that  for  any  x e 10 co, 

A(x)  31yV1z3°mf?(m,  x(m),  y(m),  z(m))  . 

Let  M be  a transitive  class  such  that  for  any  r e co  and  recursive  S c ' co,  S e M; 
this  certainly  holds  if  M is  an  e-model  of  ZF.  Then  A M denotes  the  relativization 
of  (the  normal  form  of)  A to  M.  For  the  A above  and  x e mco  fl  M, 

AM (x)  (3y  e fl  M)(Vz  e wco  fl  M)B°mR(m,  x(m),  y(m),  z(m))  . 

Finally, 

A is  absolute  for  M iff  for  any  u e M,  A(u)  -e>  AM(u)  . 

For  any  a e wco,  these  concepts  relativize  directly  to  relations  analytical  in  a and  M 
such  that  a e M.  Here  is  Shoenfield’s  Absoluteness  Theorem , parsimoniously 
stated;  as  before,  &>i  is  in  the  sense  of  V. 
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13.15  Theorem  (Shoenfield  [61]  ).  There  is  a theorem  o of  ZF  such  that  for  any 
a e w(x>  and  M a transitive  E-model  of  <r  with  co\  U [a]  C M,  every  Hj(a)  (and 
hence  n\(a))  relation  is  absolute  for  M. 

Proof.  The  proof  is  carried  out  assuming  that  M is  an  inner  model  of  ZFC 
with  a e M,  the  necessary  refinements  articulated  later.  For  notational  simplicity 
consider  a H\(a)  relation  A C ® co . 

First,  let  T be  a tree  on  2co  x co  such  that  for  any  x e “tw, 

A(x ) **  3ly(fXty)  is  well-founded)  , 

where  by  13.1  it  can  be  assumed  that  TM  — T . Now  if  x e M and  AM (x),  there 
is  a y e M such  that  fx,y)  is  well-founded  in  the  sense  of  M.  It  follows  by 
12.9(a)  that  in  M there  is  an  order-preserving  map:  7]AA).)  ->  On.  Of  course,  the 
map  also  exists  in  V,  so  A(x)  holds. 

The  converse  uses  the  13.14  representation  of  S](a)  sets.  Let  U e L\a\  c M 
be  a tree  on  co  x oo\  such  that  for  any  x e w(o, 

A(x)  Ux  is  ill-founded  . 

Now  if  x e M and  A(x),  the  ill-foundedness  of  Ux  in  V implies  by  12.9(a)  that 
there  is  no  order-preserving  map:  Ux  — > o)\ , and  so  no  such  map  can  exist  in  M 
either.  Thus,  Ux  is  ill-founded  in  the  sense  of  M,  and  using  13.14  there  AM(x) 
holds. 

To  complete  the  proof,  note  first  that  M need  only  model  a sufficiently  strong 
theorem  d of  ZF.  Neither  the  uses  of  12.9(a)  nor  of  13.14  require  DC,  as  it  is 
not  needed  in  the  equivalence  between  well-foundedness  and  the  lack  of  infinite 
descending  chains  for  well-orderable  trees.  Also,  it  can  be  checked  that  U need 
only  be  a definable  class  in  M,  for  which  <x>\  c M suffices.  H 

The  mathematical  significance  of  this  result  lies  in  the  fact  that  many  propo- 
sitions of  classical  analysis  are  and  so  if  they  can  be  established  assuming  for 
example  V — L (and  hence  CH),  then  they  can  be  established  in  ZF  alone! 

The  result  is  the  best  possible  in  the  projective  hierarchy,  since  3lx(x  L) 
is  and  cannot  be  absolute  for  M — L.  Inner  models  involving  large  cardinals 
were  to  be  devised  that  possess  stronger  absoluteness  properties  (see  volume  II). 

13.16  Corollary.  If  r e co  and  A C ra>  is  then  A e L[fl]. 

Proof.  Although  we  have  been  working  with  tree  representations  of  subsets  of 
k(a>co),  the  preceding  argument  works  with  only  notational  changes  for  A C ' co. 
Consequently,  = A IT  L[a~\  — A,  so  A e L[a],  H 

The  idea  behind  13.14  also  provides  a straightforward  approach  to  the  proof 
of  Kondo’s  II]  Uniformization  Theorem  12.3,  which  as  Addison  observed  can  be 
effectivized. 
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13.17  Theorem  (Kondo  [37,39];  Addison).  For  a e “Jw,  any  (a)  subset  of 
2(“&>)  can  be  uniformized  by  a n\(a)  set. 

Proof.  Suppose  that  A C 2(M&>)  is  n\{a).  The  task  is  to  find  a 77j  (a)  set  Aq  C.  A 
such  that  for  any  x e wa>,  if  there  is  a y such  that  A(x,  y),  then  there  is  exactly 
one  y such  that  A0(x,  y).  Let  T be  a tree  over  2a>  x a>  as  in  13.1  such  that 

A(x,  y)  **  Tx  y is  well-founded  , 

where  Tx  y is  written  for  T{x,y)  from  now  on  for  the  sake  of  convenience.  The 
argument  for  13.14  provides  a corresponding  tree  T on  2co  x co \ such  that 

Mx,y)  3g  e w(oi({x,y,  g)  e [f])  , 

where  g provides  an  order-preserving  map:  Tx<y  ->  co\  verifying  the  well- 
foundedness  of  Tx  y. 

For  any  x,  y e w(o,  if  there  is  a g such  that  (x,  y,  g)  e [ T]  at  all,  then  as  for 
any  ill-founded  tree,  there  is  a lexicographically  least  (“leftmost”)  one,  i.e.  a go 
such  that  for  any  other  g,  if  i is  least  such  that  god)  f g(i),  then  go  O’)  < gO). 
However,  for  T there  is  an  “honest  leftmost”  one,  i.e.  a gxy  such  that  for  any 
other  g,  gx,y(i)  < g(i)  for  every  i e co:  By  12.9(a)  and  the  definition  of  T,  we 
can  take 

[ Ptx  y (s,- ) if  S;  e Txy  , and 
8x,y  (0=j 

l 0 otherwise  . 

Next,  for  any  x e 10 co,  suppose  that  there  is  a y such  that  A(x,  y),  i.e.  a pair 
(y,  g)  such  that  (x,  y,  g)  e [T],  Then  there  is  a lexicographically  least  (“leftmost”) 
one,  i.e.  a pair  (yx,  gx)  such  that  for  any  other  pair  (y,  g),  if  i is  least  such  that 
(yxd),  gxd))  / (y(/'),gO)>,  then  either  yx(i)  < y(i ) or  else  yx(i)  = y(i ) and 
gx(i)  < g(i).  The  uniformization  is  carried  out  by  choosing  yx.  That  this  can 
be  done  in  a n\  (a)  fashion  relies  on  the  observation  that  gx  must  be  the  honest 
leftmost  gx,yx. 

Turning  to  formalities,  because  of  how  honest  leftmost  branches  can  be  de- 
fined let  R-  and  R be  relations  on  co  x 3 ("‘co)  given  by 

R-(i,x,y,z ) Tx.z /si  is  ill-founded  v PrKJs,)  < P'/„(s,)  , and 

R<(i,x,y,z ) TX'Z/Sj  is  ill-founded  v p7y}Ts,)  < prxz(s,)  . 

Remembering  that  pr(t)  — \\T /t\\  for  well-founded  trees  T and  t e T,  by  12.9, 

R-(i,x,y,z ) **  3f(f:TXiy/Sj  — >•  TXtZ/Si  is  order-preserving)  , and 
R<(i,x,y,z ) **  3f3°k(f:  Tx^y/Sj  ->  Txz/s''  (k)  is  order-preserving)  . 

Since  by  13.1,  {(i,x,y)  \ s,-  e Tv,v}  is  recursive  in  a,  R-  and  R<  are  thus  seen 
to  be  S\  (a)  (cf.  the  second  proof  of  13.4). 
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Now  define  A0  c 2(®<y)  by 
Ao(x,y)  **  A(x,  y)  A V1zV°m[(y(m)  = z(m) 

A V0/  < m(R~(i,  x,  y,  z)  A R-(i,x,z,y )) 

—>  (y(m)  < z(m)  v (y(m)  — z(m)  A ~'R<{m,  x,  z,  y)))]  ■ 

This  is  essentially  of  form  77*  (a)  a V'^/ta)  ->  -^/(a)],  and  hence  77/(0).  To 
decipher  it,  note  that  R-(i,  x,  y,  z)  A R-(i,  x,  z,  y)  is  equivalent  to 

( TXty/Sj  and  TXJ s,-  are  both  ill-founded  v PTxy(s,)  = Prx,z(Si))  . 

~'R<(m,  x,  z,  y)  is  equivalent  to 

(Tx,y  is  well-founded  A pTxz( s,„)  f pTxy{ sm))  , 

which  in  turn  is  equivalent  to 

(Tx,y  is  well-founded  A pTxy{ sm)  < PrX7(sm))  . 

Hence,  for  any  x e 01  co,  through  honest  leftmost  branches  Aq(x,  y)  does  indeed 
pick  out  that  y of  the  leftmost  pair  { y , g)  such  that  (x,  y,  g)  e [71]  if  there  is  one, 
and  so  the  proof  is  complete.  H 

Elements  of  this  proof,  particularly  the  simple  definability  of  the  honest  left- 
most branch,  were  later  abstracted  in  the  concept  of  a scale  (cf.  29.2).  Adjoining 
an  31  to  the  result  is  straightforward: 

13.18  Exercise  (Kondo  [39]).  For  a e “ a>,  any  X’21(«)  subset  of  2(ma>)  can  be 

uniformized  by  a S]  (a)  set.  H 

This  is  the  limit  of  directly  derivable  results  about  uniformization.  Addison 
settled  the  issue  in  L: 

13.19  Exercise  (Addison  [59]).  If  “ co  c L,  then  for  any  a ewco  and  2 < n e co, 
every  -S’,!  (a)  subset  of  2 (fed)  can  be  uniformized  by  a X’)|  (a)  set. 

Hint.  Note  that  if  A c 2(wa>)  is  /7^_ , (a),  then  in  terms  of  13.11, 

Ao(x,  y)  **  A(x,y)  A 31m(ISz.(m,  y)  A V°i— >A(x,  (m),)) 

is  Fl  (a),  and  3 yA(x,  y)  3!yAo(.r,  y).  H 

However,  Levy  [65a]  observed  that  if  one  adjoins  a Cohen  real  to  L then  in  the 
resulting  extension  there  is  a 77/  set  that  cannot  be  uniformized  by  any  analytical 
set  (see  also  Levy  [70]).  Moreover,  if  one  adjoins  uncountably  many  Cohen  reals 
to  L,  then  in  the  resulting  extension  there  is  a 77/  set  (namely  {{x,  y)  \ y $ L[x]}) 
that  cannot  be  uniformized  by  any  projective  set.  Strong  hypotheses  were  later  to 
settle  these  issues  in  different  ways  (15.14,29.9);  in  fact,  they  breathed  new  life 
into  the  study  of  structural  properties  of  classical  vintage  (§§28,29). 
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14.  Lj  Sets  and  Sharps 

In  1965  Solovay  reactivated  the  classical  program  of  investigating  the  extent  of 
the  regularity  properties  by  providing  characterizations  for  the  sets,  the  level  of 
Godel’s  delimitative  results.  He  also  investigated  the  definability  of  the  transcen- 
dent 0#  and  made  several  related  conjectures  bearing  on  the  scope  of  the  forcing 
method  that  inspired  new  avenues  of  research.  This  section  features  these  devel- 
opments, which  served  to  establish  the  relevance  of  large  cardinal  hypotheses  to 
the  study  of  the  projective  hierarchy. 

Building  on  his  Lebesgue  result  11.1,  Solovay  used  forcing  to  establish  char- 
acterizations for  the  measurability  of  the  Lj  sets.  Through  this  and  other  results, 
forcing  soon  became  established  as  a basic  construction  technique  within  set  the- 
ory. 

14.1  Theorem  (Solovay).  The  following  are  equivalent  for  a e coco: 

(a)  Every  (a)  set  of  reals  is  Lebesgue  measurable. 

(b)  UMc  I c e L[a\  is  a Gs  code  for  a null  set}  is  null. 

(c)  {x  | x is  not  random  over  L[a]}  is  null. 

(d)  Every  set  of form  {x  e cow  | L[a][x]  |=  (p[p , x]}  for  some  formula  cp  and 
p e /,[«]  is  Lebesgue  measurable. 

Proof.  By  11.10,  (b)  and  (c)  are  synoptic  and  imply  (d)  by  the  argument  for  11.11 
with  V replaced  by  L.  (d)  — > (a)  follows  from  absoluteness  13.15. 

Finally,  for  (a)  ->  (b),  let  B be  the  set  described  in  (b).  If  C c 2(fco)  is 
given  by 

C(x,  c)  **  c is  a Gs  code  for  a null  set  A ieA(, 
then  it  is  straightforward  to  check  that  C is  arithmetical.  Since 

x e B **  3'c(c  e L[a~\  a C(x,c))  , 

B is  E\(a)  by  13.9  relativized.  B is  therefore  measurable  by  assumption,  and 
it  remains  to  verify  that  it  is  null.  For  this  purpose,  an  elaboration  of  the  well- 
ordering argument  for  13.10  is  used: 

First  define  D c 2(“&>)  by 

D(x,c)  **  c e I[o]  A C(x,c)  A Wld(d  <L[a]  c —•C(x,d))  . 

Then  define  R c 2(“<y)  by 

R(x,  y)  o 31c3ld{D(x,  c)  A D(y,d)  A (c  <z.[«]  d V c — d))  . 

Thus,  R orders  the  x’s  in  B according  to  the  <l[o\ -least  Gs  code  c for  a null  set 
such  that  x e Ac.  R is  a prewellordering , i.e.  a well-ordering  except  that  there 
may  be  distinct  x and  y such  that  R(x,  y ) and  R(y,  x). 

By  13.11  relativized  D is  X'j  (a),  and  so  also  is  R.  Hence,  R is  measur- 
able by  assumption.  Since  2(wa>)  fl  <l[o\  has  ordertype  < a>\,  for  any  y. 
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Ry  — [x  | R(x,  y)}  is  null,  being  a countable  union  of  null  sets.  Thus,  by  Fubini’s 
Theorem  0.10,  R is  null.  However,  the  same  argument  can  be  applied  to  the 
complement  {(x,  y)  e B x B \ ~'R(x,y)},  a measurable  set  since  B is,  to  show 
that  it  too  must  be  null.  But  then,  B x B is  null,  and  so  also  must  be  B.  H 

An  analogous  argument  based  on  remarks  in  the  proof  of  11.11  about  the 
Baire  property  establishes  the  following: 

14.2  Theorem  (Solovay).  The  following  are  equivalent  for  a e “ co : 

(a)  Every  (a)  set  of  reals  has  the  Baire  property. 

(b)  [J[AC  | c e L[a]  is  an  Fa$  code  for  a meager  set}  is  meager. 

(c)  {x  | x not  a Cohen  real  over  L[u]}  is  meager. 

(d)  Every  set  of  form  {x  e “co  | L[a][x]  \=  cp[p,  x]}  for  some  formula  <p  and 

p e L[a]  has  the  Baire  property.  H 

14.3  Corollary.  If  co\^  < a>  i,  then  every  IT,1  (a)  set  of  reals  is  Lebesgue  measur- 
able and  has  the  Baire  property.  In  particular,  if  there  is  a measurable  cardinal, 
then  every  Lj  set  of  reals  is  Lebesgue  measurable  and  has  the  Baire  property. 

Proof.  If  “fflflL[a]  is  countable,  then  the  (b)’s  of  the  theorems  obtain.  The  second 
assertion  follows  from  the  first,  since  by  9.19,  co \ is  inaccessible  in  L[fl]  for  every 
a e “co.  H 


Solovay’s  ideas  were  recast  by  Moschovakis  [80:  8G]  to  provide  a forcing- 
free  proof  of  this  corollary.  While  the  statement  about  measurable  cardinals  is 
not  very  sharp,  it  is  a direct  implication  between  classical  concepts  that  provided 
a counterweight  to  Gddel’s  V — L delimitations  for  definable  sets  of  reals.  In 
this  way,  Solovay  established  that  although  measurable  cardinals  do  not  decide  the 
Continuum  Hypothesis,  they  do  have  a direct  mathematical  effect  on  the  reals.  On 
the  other  hand,  Silver  was  to  show  that  even  measurable  cardinals  do  not  entail  the 
Lebesgue  measurability  of  A\  sets  (20.19).  One  of  the  prominent  themes  of  the 
pursuit  of  stronger  hypotheses  is  the  measurability  of  more  and  more  projective 
sets. 

The  consistency  of  ZFC  plus  the  proposition  that  every  Li  set  is  Lebesgue 
measurable  and  has  the  Baire  property  does  not  require  large  cardinal  hypotheses. 
Martin- Solovay  [70:  §4]  showed  that  it  follows  from  Martin’s  Axiom  (MA)  + 
--CH,  and  moreover,  by  an  co\ -length  iteration  of  the  forcing  described  there,  it 
can  be  made  consistent  with  CH.  On  the  other  hand,  Shelah  [84]  was  to  show  that 
the  Lebesgue  measurability  of  every  Lj  set  does  require  the  consistency  strength 
of  an  inaccessible  cardinal  (see  volume  II),  complementing  Solovay’s  result  11.1. 

Solovay  also  established  outright  the  Lebesgue  measurability  of  sets  in  a 
substantial  subclass  of  A,.  To  describe  this  class,  we  have  to  look  more  carefully 
at  the  defining  formulas  for  relations. 

For  a e “co  and  A C “co,  A is  provably  A\(a)  iff  there  are  A2  formulas 
rjr(v },  i>2,  u],  uj)  and  <p(v J,  v^,  vj,  vj)  without  function  quantifiers  such  that  for 
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any  x e coco,  A(x)  iff  A2  \=  31  v\ty[x , a]  iff  A2  f=  'ilv\^lv\tp[x,d\,  and 

moreover,  provable  in  ZFC  is: 

A2  1=  **  V'd \llv\tp)  . 

(The  definition  extends  analogously  to  A c 'co  x k{u>co)  and  also  to  the  correspond- 
ing concepts provably  A\  and  provably  Aj.)  Thus,  A is  A\(a)  in  the  strong  sense 
that  there  are  S\{a)  and  n\{a)  formulations  whose  equivalence  is  a theorem  of 
ZFC.  For  example,  V — L implies  that  2(®co)  fl  <l  is  A\,  but  it  is  not  provably 
A\.  By  introducing  vacuous  quantifiers,  every  S\(a)  relation  is  provably  A](a). 
Andrei  Kolmogorov  [28]  introduced  the  R-sets  by  generalizing  the  Operation  (A) 
of  Suslin  [17]  into  a transfinite  hierarchy,  and  showed  that  they  are  Lebesgue  mea- 
surable. Recently  John  Burgess  [82,  83]  and  Rana  Barua  [84]  have  redeveloped 
and  extended  the  theory  of  these  sets.  It  can  be  shown  that  the  R-sets  are  provably 
A\.  Thus,  the  following  subsumes  12.2(a)(b)  and  more: 

14.4  Exercise  (Solovay  - cf.  Fenstad-Normann  [74]).  Eveiy  provably  A,  set  of 
reals  is  Lebesgue  measurable  and  has  the  Baire  property. 

Hint.  Suppose  that  A is  provably  A\(a)  for  some  a e wco,  and  let  r be  a theorem  of 
ZFC  establishing  the  equivalence  between  a formulation  and  a 77]  formulation 
of  A.  If  a is  the  theorem  of  ZF  of  13.15,  then  any  transitive  e-model  M of  a A r 
with  a e M and  containing  the  recursive  relations  is  absolute  for  A.  (The  point 
is  that  co i c M as  in  13.15  is  no  longer  required,  as  the  Ll]  formulation  can  be 
used  contrapositively  in  place  of  the  argument  with  the  tree  over  to  x co\ .)  Now 
work  with  a countable  such  M satisfying  finitely  more  axioms  of  ZFC  so  that  the 
arguments  for  (b)  —>  (a)  of  14.1  and  14.2  can  be  used  with  M in  place  of  L[a].-\ 

Finally,  recent  characterizations  are  provided  that  further  attest  to  the  intrinsic 
importance  of  random  and  Cohen  reals  in  descriptive  set  theory:  They  complement 
14.1  and  14.2,  inform  on  Godel’s  13.10,  and  could  well  have  been  noticed  much 
earlier.  First,  a preparatory  exercise: 

14.5  Exercise.  Suppose  that  M is  an  inner  model  of  ZFC.  Then: 

(a)  If  there  is  a real  random  over  M,  then  such  reals  are  dense:  For  any 
closed  code  c e M for  a non-null  set  there  is  a real  in  Ac  random  over  M. 

(b)  If  there  is  a Cohen  real  over  M,  then  such  reals  are  dense:  For  any 
s € <u>co  there  is  a Cohen  real  3 s over  M. 

Hint,  (b)  is  much  easier  than  (a),  and  both  can  be  deduced  from  the  fact  that  the 
notions  of  forcing  are  weakly  homogeneous  (cf.  10.19).  However,  this  fact  for 
random  real  forcing  depends  on  involved  results  about  Lebesgue  measure,  so  a 
direct  approach  is  outlined  for  (a): 

Let  x be  random  over  M.  The  idea  is  to  develop  a permutation  p\  co  —>  co  so 
that  the  induced  homeomorphism  ~p:  wco  ->  “co  (defined  by  ~p(f)(n)  = f(p(n )) 
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for  n e co)  satisfies  p(x ) e Ac.  Such  induced  homeomorphisms  preserve  measure, 
so  p(x)  will  also  be  random  over  M.  To  begin  with,  it  can  be  assumed  that 

Ac  = a,co-{f[0{sij)  | j eco] 

where  the  0(s;.)’s  are  pairwise  disjoint,  p will  be  comprised  of  transpositions 
p(am ) = bm  and  p(b,„)  — am  for  am’s  and  bm’s  all  distinct  and  exhausting  co, 
defined  in  pairs  (am,  bm)  by  recursion  as  follows: 

Suppose  that  the  am’s  and  bm’s  for  m < n have  been  defined,  recursively 
satisfying 

mL(.\J{°(sij ) I sh  2 {{a0,x(b0)),...,  {a„-Ux(b„- 1)>}})  < Y\m<n2~(x(bm)+1)  • 

(The  basis  n = 0 is  included  vacuously  here  with  the  value  on  the  right  being  1, 
a consequence  of  the  assumption  that  Ac  is  not  null.)  Let  an  be  the  least  member 
of  co  different  from  all  the  <7,„’s  and  bm' s for  m < n.  (So  ao  — 0.)  There  must  be 
a k e co  such  that 

'nL(U{°(siy)  I s0  ^ {{ao,  x(b0)), ...,  (a„_i,  x(b„-\)),  {an,k)}}) 

j-(k+ 1)  _ r-r  r,-(x(bm)+l) 

^ ^ 1 I m<n^ 

Since  x is  random,  choose  b„  ^ a„  and  different  from  all  the  fl„,’s  and  bm’ s for 
m < n such  that  x(bn)  — k.  Thus,  the  recursion  assumption  is  perpetuated.  It  can 
be  checked  that  this  construction  insures  that  ~p(x)  e Ac.  H 

14.6  Theorem  (Ihoda-Shelah  [89]).  Suppose  that  a e "‘co.  Then: 

(a)  Every  A\(a)  set  of  reals  is  Lebesgue  measurable  iff  there  is  a real  random 
over  L[fl]. 

(b)  Every  A\ (a)  set  of  reals  has  the  Baire  property  iff  there  is  a Cohen  real 
over  L\a\ 

Proof.  As  usual,  only  the  measure  result  is  established.  The  forward  direction 
follows  the  proof  of  (a)  ->  (b)  of  14.1,  noting  that  the  R in  that  proof  would 
actually  be  A\(a)  if  there  were  no  reals  random  over  L[a]  (cf.  13.10). 

The  converse  is  a variation  of  the  argument  for  14.4.  Suppose  that  there  is  a 
real  random  over  L[a]  and  that  A C k(‘°co)  is  A\(a).  Taking  k — 1 for  simplicity, 
there  are  consequently  Ejia)  relations  Rq,  Ri  c °> co  such  that  for  any  x e wco, 

A(x)  Rq(x)  —'Ri(x)  . 

As  usual,  if  M is  a transitive  class  containing  all  recursive  relations,  then  Rf 
denotes  the  relativization  of  (the  normal  form  of)  R,  to  M.  Let  P — (B*)L^a\  the 
random  real  forcing  in  the  sense  of  L[a\,  and  r a P-name  for  the  random  real. 
Then 

D — {p  e P \ p \[  (r ) for  exactly  one  i < 2}  e L[a~\ 

is  dense  in  P by  the  following  argument  in  V : Starting  with  an  arbitrary  member 
of  P,  which  for  demonstrating  density  can  be  taken  to  be  of  form  Ak^  for  some 
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closed  code  c e L[a],  there  is  an  x e Ac  random  over  L[ci\  by  14.5(a).  There  is 
exactly  one  i < 2 such  that  R/(x),  whereupon  T,\  Absoluteness  13.15  implies  that 
r'  ^x\x) . Hence,  there  is  a p < A^fll  such  that  p e D. 

Since  D is  dense  open  in  P,  there  is  in  L\a~\  a maximal  antichain  E C D 
consisting  of  closed  sets,  which  by  the  &>i-c.c.  is  countable.  For  i < 2,  set 

Yj  = [c  e 01  co  | c e L[ci\  is  a closed  code  A 

Apl  e e A A^  |b  flf[alw(r)}  , and 
Z,  = U Me  I ceYi } . 

It  now  suffices  to  verify  that  Z,  — R,  is  null  for  i < 2.  Granted  this  and  invoking 
the  absoluteness  11.8  of  codes,  that 

(i)  Z0  fl  Z\  is  null  since  Y0  fl  Y\  = 0 , and 

(ii)  nii(Zo)  + mi(Z\ ) = 1 by  the  maximality  of  E 

will  together  imply  that  Z0  A R0  is  null.  Hence,  it  can  be  concluded  that  R0  = A 
is  measurable. 

To  verify  that  Z,  — R,  is  null  for  i < 2,  let  M e I[a]  be  a countable  transitive 
e-model  of  enough  of  ZFC  with  To-  Y\  e M.  (It  will  suffice  to  take  M < L,_[a] 
for  some  sufficiently  large  regular  X,  and  such  M model  ZFC  except  possibly  for 
the  Power  Set  Axiom.  In  what  follows,  pretend  that  M |=  ZFC  for  the  use  of 
established  terminology.)  Since  M is  countable,  the  set  of  reals  not  random  over 
M is  null  (cf.  11.10).  If  on  the  other  hand  x e “ co  is  random  over  M and  x € Z,-, 
then  x e Ac  for  some  c e T,  and  so  x e Hence,  R^'\x)  by  genericity, 

and  so  by  the  upward  direction  of  Lo  Absoluteness  13.15  (which  does  not  require 
co i C M ) it  follows  that  Rj(x).  This  confirms  what  was  to  be  verified,  and  so  the 
proof  is  complete.  H 

As  observed  by  Solovay,  if  Ki  many  random  reals  are  generically  added  to 
L[a],  then  the  Zb1  (a)  set  10 co  fl  L[a\  is  not  Lebesgue  measurable  in  the  extension 
(see  Jech  [03:  535ff]).  Thus,  the  measurability  of  every  A\(a)  set  does  not  imply 
the  measurability  of  every  Zjfiz)  set.  Analogous  remarks  hold  for  the  Baire 
property.  As  to  the  difference  in  the  proofs  of  14.4  and  14.6,  the  countable  M of 
the  former  is  absolute  for  A by  provability,  while  it  was  necessary  to  incorporate  a 
maximal  antichain  into  the  M of  the  latter,  which  in  turn  depended  on  the  property 
that  Rq{x)  **  -■f?i(v)  holds  for  all  .v  random  over  L[a], 

Solovay  also  characterized  the  perfect  set  property  for  Y}2  sets  by  a forcing 
argument  as  in  11.11.  Then  an  elegant  result  of  Richard  Mansfield  revealed  the 
structural  essence.  It  too  was  proved  initially  by  forcing,  but  Solovay  provided 
a proof  that  can  be  seen  as  an  elaboration  of  the  Cantor-Bendixson  argument 
(11.3(a)): 

14.7  Theorem  (Mansfield  [70]).  Suppose  that  for  some  infinite  Y,  T is  a tree  on 
co  x Y such  that  T e L\T\,  A C coco,  and  A = p\T],  i.e.  for  any  x e ®o >, 
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A(x)  3/  e wF((x,  f)  e [T])  . 

Then  either  A C L[T\  and  |A|  < 1 7" | , or  else  A has  a perfect  subset. 

Proof.  For  any  tree  U on  co  x Y,  let  U'  consist  of  those  members  of  U having 
extensions  in  U with  incompatible  first  coordinates,  i.e. 

U'  = {{s,  h)  e U | 3(^0,  h0),  (su  hi)  e U«s0,  h0)  2*  (s,  h) 

A (si,hi)  2*  {s,h)  A |s0|  = \si\  A Sof^Si)}. 

Now  set  To  — T,  Ta+ 1 = T'a,  and  for  limit  ordinals  8 > 0,  Tg  = n«<5  It  is 
clear  that  each  Ta  e L[T\.  There  is  a fi  < |F|+  such  that  Tp+\  — Tp\  set  T*  = Tp 
for  the  least  such  ft. 

If  T*  = 0,  then  A c L[T]  and  |A|  < |r|:  Suppose  that  x e wa>  satisfies 
A(x),  so  that  there  is  some  / e mY  with  (x,  f)  e [T],  Clearly  there  is  an 
a < p such  that  (. x , f)  e [rff]  — [T„+i].  But  then,  there  is  an  in  e a>  such  that 
(x\m,  f\m)  e Ta  — Ta+\,  so  that  any  (s,  h)  e Ta  with  5 2 x\m  and  h 2 / \m 
satisfies  sCi.  Hence, 

x — 1J{j  | 3 h(s  2 x\m  A /;  2 f\m  A [s,  h)  e Tq,)}  , 

and  so  x e L[T ].  Thus,  A c L\T ].  This  analysis  shows  that  any  x e wco 
satisfying  A(x)  is  determined  by  an  a < ft  and  a pair  (. x\m , f\m)  e Ta.  Hence 
also  |A|  < |^|-|r|  = 

The  proof  is  completed  by  showing  that  if  T*  / 0,  then  A has  a perfect 
subset:  Starting  with  a member  of  T*  and  using  T*'  — T*,  recursively  define 
st  € <0>u>  for  each  t e <<u2  such  that  for  any  t,  u e <w2, 

(i)  3 h((s„h)  e T*)  , 

(ii)  if  t C m,  then  st  C su  , and 

(iii)  if  |r|  = \u\  yet  t / u,  then  st(i)  f su(i ) 
for  some  i e domfy,)  fl  dom(y„)  . 

Then  S — {s  e <mu>  \ 3 1 e <a>2{s  c s, ) | is  a tree  on  a>,  [5]  c A,  and  building  on 
12.10,  [S]  is  perfect.  H 

The  following  corollaries  are  immediate  from  the  tree  representations  of  L] 
sets  (13.13(c))  and  sets  (13.14). 

14.8  Corollary  (Suslin  - Luzin  [17]).  Every  L]  set  of  reals  has  the  perfect  set 

property.  H 

14.9  Corollary  (Solovay  [69]).  If  a e 01  co  and  A is  a S\{a)  set  of  reals,  then 

either  A C L\a)  or  A has  a perfect  subset.  H 

Recalling  the  set  C\  defined  after  13.12,  14.9  implies  that  if  < w\,  then 
the  countable  set 
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q = “fflnL[a] 

is  a T,\  (a)  set  such  that  any  Elia)  set  without  a perfect  subset  is  a subset  of  Cf. 
Thus: 

14.10  Theorem  (Solovay  [69]).  The  following  are  equivalent  for  any  a e mco: 

(a)  Every  E\  (a)  set  of  reals  has  the  perfect  set  property. 

(b)  Every  Tl\  (a)  set  of  reals  has  the  perfect  set  property. 

(c)  < 00\. 

Proof,  (a)  ->  (b)  is  immediate,  (b)  -*  (c)  is  13.12  relativized,  and  (c)  ->  (a) 
follows  from  the  above  remarks.  H 

This  result  refines  11.6  and  shows  that  the  perfect  set  property  for  all  nj 
sets  requires  the  consistency  strength  of  an  inaccessible  cardinal.  As  observed 
by  Lyubeckij  [70],  14.3  and  14.10  have  the  curious  consequence  that  if  every 
Li,  set  has  the  perfect  set  property,  then  every  £[  set  is  Lebesgue  measurable  - 
with  the  implication  between  these  classical  properties  proceeding  through  Vo  e 
< &>i).  Lyubeckij  [71]  rederived  several  of  Solovay’s  results  about  £[ 

sets. 

Definability  of  Sharps 

The  main  motivation  for  Solovay’s  study  of  0#  was  to  analyze  its  definability. 
By  13.16  every  subset  of  co  is  constructible.  Using  methods  of  Gaifman  and 
Rowbottom,  Solovay  showed  in  the  autumn  of  1965  that  if  there  is  a measurable 
cardinal,  then  there  is  a non-constructible  A\  subset  of  co.  Then  in  the  context 
of  Silver’s  work  on  indiscemibles,  Solovay  and  Silver  independently  saw  that  0# 
has  stronger  properties.  For  present  purposes,  temporarily  specify  that  if  cp  is  a 
formula  in  the  language  £*  of  §9,  then  r</d  e co  is  its  Gddel  number  in  some 
recursive  arithmetization  of  £*.  Recall  that  by  this  means,  0#  is  regarded  as  a 
subset  of  oo. 

14.11  Theorem  (ZF)(Solovay  [67],  Silver  [71]).  The  relation  R C defined  by 

R(x)  -co-  0#  exists  A x eM2  A {/«  | x(m)  — 1}  = 0# 

is  nf 

Proof.  We  analyze  the  definition  from  §9  that  0#  is  the  unique  EM  blueprint 
satisfying  conditions  (I)-(III);  this  uniqueness  is  a consequence  of  ZF,  since  as 
observed  in  §9  the  0#  theory  does  not  require  AC.  For  any  x e “&>,  let  Thv  = 
{<p  | xf<f)  — 1}.  Now  define  S c.  wco  by:  S(x ) iff  x e ®2;  for  any  in  e co, 
x(tn)  = 1 implies  that  m = ^ f for  some  sentence  cp  of  and  Thv  is  a complete 
and  consistent  theory  of  £*  containing 

(i)  the  sentence  cto  of  3.3(a) , 
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(ii)  the  sentences  asserting  that  the  c,  ’s  are  indiscemibles  , and 

(iii)  the  sentences  for  satisfying  conditions  (II)  and  (III)  . 

Then  it  is  straightforward  to  check  that  S is  arithmetical;  to  assert  that  Thv  is 
consistent,  it  is  enough  to  say  that  there  is  no  integer  coding  a proof  from  it  of 
i>0  / vq-  Note  that  if  S(x),  then  by  the  Completeness  Theorem  Thv  has  a countable 
model.  But  technically  speaking,  (i)  and  (ii)  do  not  suffice  to  show  that  Thv  is  an 
EM  blueprint,  for  only  if  the  model  is  well-founded  can  its  transitive  collapse  be 
taken  and  3.3(a)  invoked.  However,  this  will  be  taken  care  of  when  condition  (I) 
is  incorporated. 

For  this  purpose,  recall  the  terminology  of  13.8;  in  particular,  every  countable 
well-ordering  is  an  Ey  for  some  y e "ft).  The  R c wco  of  the  theorem  is  then  seen 
to  be  equivalent  to: 

S(x ) A V1yV1z[(£'v  is  a well-ordering  A Mz  satisfies  9.4  with  Th( 

the  theory  and  indiscemibles  of  ordertype  Ey ) ->  Mz  is  well-founded]  . 

That  Mz  satisfies  9.4  etc.  is  : Beyond  what  Mz  is  to  model,  one  must  assert 
that  there  is  an  order-preserving  injection  of  Ey  into  OnM;  such  that  its  image  is  a 
set  of  indiscemibles  for  Mz,  and  every  element  of  Mz  results  from  a Skolem  term 
applied  to  these  indiscemibles.  Thus,  R has  fomi  V'yV'zf/T]  A — >■  77]]  and 

so  is  77],  H 

14.12  Corollary. 

(a)  0#  is  absolute  for  transitive  e- models  M of  ZF  such  that  a>\  c M in  the 
following  sense: 

M \=  'There  is  an  EM  blueprint  satisfying  (I)-(III)1  iff  0#  e M , 

in  which  case  M \=  r0#  is  the  unique  EM  blueprint  satisfying  (I)-(III)1 . 

(b)  0#  is  a non-constructible  A\  subset  of  a>. 

(c)  Assume  0#  exists.  Then  eveiy  x e V(co)  (T  L is  one-one  reducible  to  0#, 
i.e.  there  is  an  injective,  total  recursive  function  f'.ca-^-co  such  that  x = /~*  ( 0#), 
and  so  in  particular,  x is  A\. 

Proof,  (a)  This  follows  from  2,  Absoluteness  13.15.  With  R as  in  14.11,  if 
M 1=  There  is  an  EM  blueprint  satisfying  (I)-(III),  then  there  is  an  x e M such 
that  ( R(x))m . Hence  R(x),  and  so  {m  \ x(m)  = 1}  = 0#  e M. 

(b)  By  (a),  0 # <£  L.  Also,  for  any  m e a>, 

m eO*  o 3x{R(x)  A x(m)  = 1) 

Wx(R(x)  — > x{m ) = 1)  . 

The  first  expression  shows  that  0#  is  X’31,  and  the  second,  that  it  is  77]. 

(c)  x — tL{t flt . . . , tfn)  for  some  term  t and  indiscemibles  (fl ; , t|n.  As  a 

recursive  arithmetization  had  been  assumed,  the  function  / e "&>  defined  by 

f(m ) = ^ m e t(c\, ....  c„y 
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is  an  injective  total  recursive  function.  But  clearly,  x — f 1 (0#).  The  last 
assertion  of  (c)  follows,  as  0#  is  A\.  ~\ 

(c)  implies  by  remarks  after  9.12  that  if  0#  exists,  then  the  set  of  (Godel 
numbers  of)  sentences  true  in  L is  one-one  reducible  to  0#. 

These  properties  of  0#  relativize  directly  for  any  a c a>  to  a#  (as  defined  at 
the  end  of  §9)  over  L[a],  Consequently,  the  assumption  that  a#  exists  for  every 
a c co  provides  a rich  landscape  for  the  study  of  relative  constructibility,  akin  to 
the  study  of  Turing  reducibility  in  recursion  theory.  For  any  sets  a and  b, 

a <c  b iff  a e L\b]  , 

and  let  <c  and  =c  have  the  derived  meanings.  Restricting  the  discussion  to 
V( co),  the  equivalence  classes  under  =c  are  called  the  constructibility  degrees  of 
reals,  and  the  orderings  <c  and  <c  carry  over  to  these  degrees.  V(co)  fl  L is  the 
minimum  degree  and  the  degrees  form  an  upper  semi-lattice  under  <c.  Because  of 
the  transcendence  properties  of  a <c  a#,  # can  be  regarded  as  the  analogue  of  the 
jump  operation  on  Turing  degrees.  Jeffrey  Paris  [74]  established  a significant  result 
on  patterns  of  indiscemibles  which  implies  that  if  a <c  b <c  a*,  then  a#  =c  b#\ 
the  analogous  assertion  about  Turing  degrees  with  jump  is  well-known  to  be  false. 
Paris’s  result  was  a basic  ingredient  for  further  results:  Leo  Harrington  and  Kechris 
in  their  [77]  established  partial  results  toward  a conjecture  of  Solovay  (see  below). 
Also,  Farrington  [82]  showed  that  the  first-order  theory  of  the  upper  semi-lattice 
of  constructibility  degrees  with  # has  the  same  sort  of  recursive  isomorphism  to 
the  truth  set  of  second-order  arithmetic  as  was  established  by  Simpson  [77]  for 
the  Turing  degrees.  In  evident  analogy  with  the  Turing  degrees,  much  more  can 
still  be  done  in  this  investigation  of  #. 

The  remarkable  properties  of  0#  prompted  Solovay  to  make  sweeping  con- 
jectures in  the  late  1960’s  that  raised  broad  issues  about  the  scope  of  the  forcing 
method.  The  first  was  the  Genericity  Conjecture'. 

If  a C On  and  0#  j L\a\,  then  a is  generic  over  L . 

This  conjecture  turns  on  what  is  meant  by  genericity  over  L,  and  a e L are 
allowed  to  be  generic  over  L in  a trivial  sense.  If  M is  an  inner  model,  a set 
a c On  is  set-generic  over  M iff  there  is  a p.o.  P e M and  a G P-generic 
over  M such  that  M[a]  — M[G].  Since  forcing  with  any  (set)  p.o.  P preserves 
cardinals  > |P|+,  0#  is  not  set-generic  over  L by  9.17(c)  and  14.12(a).  Confident 
in  the  apparent  generality  of  forcing,  Solovay  originally  meant  his  conjecture  to 
be  an  assertion  of  minimal  transcendence  for  0#  from  the  set  forcing  point  of 
view.  However,  in  1975-6  Jensen  refuted  this  version  of  the  conjecture  using  his 
impressive  “coding  the  universe  in  a real”  technique,  with  which  he  established 
the  following  result. 

14.13  Theorem  (Jensen  - Beller-Jensen- Welch  [82]).  Assuming  GCH,  there  is  a 
proper  class  notion  of  forcing  P such  that  if  G is  P -generic,  then  V and  V[G] 
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Forcing  with  P moreover  preserves  various  large  cardinal  properties  con- 
sistent with  V — L like  weak  compactness.  In  the  mid-1980’s  Sy  Friedman 
[94]  refined  and  simplified  Jensen’s  proof  toward  the  development  of  the  much 
stronger  “strong  coding”  technique  of  S.  Friedman  [87,87a].  In  1989  Shelah  and 
Lee  Stanley  (see  their  [90,95,95a])  found  a short  proof  of  Jensen’s  original  cod- 
ing result  assuming  that  0#  does  not  exist,  and  subsequently  so  did  S.  Friedman 
[97], 

Jensen  also  showed  that  it  is  consistent  relative  to  ZFC  that  there  is  a real  a 
as  in  14.13  so  that  0#  ^ L[a\  yet  a is  not  set-generic  over  L,  and  that  such  an  a 
can  be  constructed  outright  from  0#: 

14.14  Theorem  (Jensen  - Beller-Jensen- Welch  [82]).  Assume  that  0#  exists.  Then 
there  is  an  a C a>  such  that  a <c  0#  yet  a is  not  set-generic  over  L.  H 

The  more  liberal  interpretations  of  genericity  over  L involve  proper  class 
notions  of  forcing;  see  Beller-Jensen- Welch  [82:  155]  for  several  formulations. 
Suppose  that  M is  an  inner  model.  For  P c M,  { M , P)  is  amenable  iff  PC\x  e M 
for  every  x e M.  This  is  a weaker  condition  than  requiring  that  P be  a (definable) 
class  in  the  sense  of  M.  a C On  is  amenable-generic  over  M iff  there  is  a class 
p.o.  P such  that  (M,  P)  is  amenable  and  the  forcing  relation  |[p  restricted  to 
formulas  of  any  bounded  complexity  is  definable  in  ( M , P),  and  there  is  a G P- 
generic  over  M (i.e.  G meets  every  dense  subclass  of  P definable  in  M in  addition 
to  the  usual  genericity  requirements)  such  that  a e M[G ].  0#  is  not  amenable- 
generic  over  L.  S.  Friedman  developed  a notion  of  forcing  with  hyperclasses 
(classes  of  classes)  to  show  that  if  0#  exists,  then  there  is  an  a c co  such  that 
a <c  0#  yet  a is  not  amenable-generic  over  L.  Mack  Stanley  [94]  established 
the  consistency  of  the  existence  of  a real  not  amenable-generic  over  any  inner 
model  to  which  it  does  not  belong.  (However,  the  existence  of  Stanley’s  real  is 
incompatible  with  that  of  0#.) 

A second  conjecture  of  Solovay’s  had  to  do  more  directly  with  the  definability 
of  0#.  For  x e wco,  or  x c co  regarded  as  a member  of  w2  through  its  characteristic 
function, 

x is  a Tl\  singleton  iff  [x]  is  a /7(J  subset  of  u>co  . 

By  Ej  Absoluteness  13.15,  a n]  singleton  is  definable  by  a formula  that  defines  it 
in  every  inner  model  that  contains  it,  and  by  14.11,  0#  is  such  a singleton.  Because 
this  absolute  definability  seemed  a structurally  stringent  constraint,  Solovay  made 
the  n\  Singleton  Conjecture: 

There  are  no  Tl\  singletons  a such  that  a L yet  a <c  0#. 
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That  is,  0#  is  a minimal,  non-constructible  77,1  singleton  with  respect  to  relative 
constructibility.  The  first  related  work  was  the  discovery  of  several  forcing  notions 
for  adjoining  a 77^  singleton  to  a model  of  ZFC  without  large  cardinal  hypothe- 
ses: Jensen-Solovay  [70],  Jensen  [70],  and  Jensen-Johnsbraten  [74].  These  are 
particularly  elegant  constructions,  but  they  do  not  yield  77 \ singletons  if  0#  ex- 
ists. The  Jensen-Johnsbraten  real  was  the  most  absolute:  if  a>\  = u>\,  then  it 
is  a 77^  singleton.  Building  on  Jensen’s  coding  techniques,  several  people  then 
established  results  more  directly  toward  the  conjecture.  For  instance,  Rene  David 
[82]  showed  that  there  is  a non-constructible  real  a satisfying  a <c  0#  such  that 
for  any  set-generic  b c On  over  L[a],  a is  a n\  singleton  in  L[fl][b]  (through 
the  same  n\  formula  for  all  such  b).  See  David  [84,  89]  for  various  properties  of 
0#  simulated  by  reals  in  generic  extensions.  Then  in  culminating  work,  Friedman 
finally  refuted  the  conjecture  outright: 

14.15  Theorem  (S.  Friedman  [90]).  Assume  that  0#  exists.  Then  there  is  a 77 J 
singleton  a such  that  a ^ L yet  a <c  0.  H 

Solovay  apparently  conjectured  the  negation  of  his  original  conjecture  soon 
after  learning  of  Jensen’s  14.14,  and  this  is  the  orientation  of  S.  Friedman  [90], 

Solovay  had  made  a third  conjecture  asserting  in  part  that  all  constructibility 
degrees  containing  77,  singletons  are  produced  by  the  iterative  application  of  #. 
Before  this  was  refuted  by  Friedman’s  work,  Harrington-Kechris  [77],  building 
on  Paris  [74],  had  made  observations  that  tended  to  support  the  conjecture.  They 
showed  that  if  x c co  is  a 77,  singleton,  then  either  0#  <c  x or  x#  =c  0#.  Rela- 
tivizing  this,  they  observed  that  the  constructibility  degrees  of  0#,  0##,  . . . 

are  the  first  co  degrees  of  sharps  of  77,1  singletons. 

Uniform  Indiscernibles 

Solovay’ s conjectures  generated  results  that  have  led  to  a considerable  under- 
standing of  sharps,  forcing,  and  definability  at  the  second  level  of  the  projective 
hierarchy.  Another  important  chapter  was  begun  in  the  late  1960’s  when,  as  dis- 
cussed in  the  next  section,  sharps  were  used  to  analyze  the  sets  in  the  third  level 
of  the  projective  hierarchy.  In  preparation,  some  results  are  established  here  about 
the  interplay  of  the  sharps,  building  on  the  §9  theory  in  relativized  form. 

Up  to  the  discussion  above  of  77(J  singletons  it  was  convenient  to  regard  reals 
for  the  theory  of  sharps  as  subsets  of  co,  since  the  classes  L [A]  had  only  been 
defined  for  A c On;  henceforth,  it  is  incumbent  to  consider  reals  as  members  of 
wco.  For  a e aco,  set 

a#  — {{n,  a(n)}  \ n e co}#  , 

i.e.  the  sharp  of  the  graph  of  a coded  as  a subset  of  co,  and  stipulate  that 

L[a]  = L[{{n,  a(n))  \ n e <w}]  , and 

7„  is  the  corresponding  class  of  indiscernibles  for  L[a]  . 
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Moreover,  for  either  a c co  or  a e 01  co,  construe  a#  c co  also  as 
a#  e a>2  vja  characteristic  function 

as  befits  the  context. 

The  uniformity  of  the  relativization  of  14.11  is  first  recorded: 

14.16  Exercise  (ZF).  The  relation  S C 2(",a>)  defined  by 

S(a,x)  **  a#  exists  A x = a# 

is  n\.  H 

For  a e wco,  the  relativization  of  14.12(a)  leading  to  the  absoluteness  of  a# 
is  direct.  Taken  for  granted  in  what  follows  is  the  straightforward  consequence 
that  the  definition  from  a#  of  the  class  Ia  of  indiscemibles  for  (L[fl],  e,fl),  in 
terms  of  Skolem  hulls  and  so  forth  as  in  §9,  is  absolute  for  any  inner  model  M 
containing  a*.  Properties  of  Ia  that  follow  from  the  relativization  of  9.14  will 
also  be  assumed,  e.g.  Ia  is  a closed  unbounded  class  containing  every  uncountable 
cardinal,  and  if  <5  e I„,  then  (L^[u],  e,  a)  -<  { L\a\ , e,  a). 

When  assuming  da  e 0}co{a#  exists),  set 

I*  = fl. *,Ja  , 

the  class  of  uniform  indiscemibles. 

14.17  Exercise  (ZF). 

(a)  Suppose  that  a,  b e wco  with  a e L[b],  and  b # exists.  Then  a#  exists  and 

h c Ia. 

(b)  Assume  da  e “<u(a#  exists).  Then  I*  is  a closed  unbounded  class  of 
ordinals  containing  every  uncountable  cardinal,  and  for  any  a e coa>  and  u e I*, 
I a (T  u has  ordertype  u. 

Hint,  (a)  For  the  existence  of  a#,  note  that  for  any  X > co,  (f  fl  X)  — i]  is  a 
set  of  indiscemibles  for  Lx\a\  e L[b]  for  some  r]  < X (thrown  in  to  get  above 
indiscemibles  figuring  in  some  definition  of  a). 

Assuming  next  that  y $ Ia,  we  show  that  y £ Ip.  First, 

Y = t{L[a]'€'a)  {x  i, . . . , xm,yu  ...,  y„) 

for  some  Skolem  term  t (of  Ce(a))  and  x\  <...<  xm  < y\  < ...  < y„  all  in  Ia 
with  xm  < y < y\.  By  the  remarkable  condition  (III)  of  §9  relativized,  it  can  be 
assumed  that  the  v;’s  are  in  the  closed  unbounded  class  Ia  fl  f - Also,  by  further 
applications  of  (III)  each  xt  is  definable  in  L[b]  in  terms  of  members  of  f — {y} 
and  a is  definable  in  L[b]  in  terms  of  members  of  4 — (y  + 1).  Consequently,  y 
is  definable  in  L[b]  in  terms  of  members  of  /*  — {y},  and  hence  y £ h- 

(b)  For  the  last  assertion,  suppose  that  a e m(o,  u e /*,  and  f < u.  By 
the  remarkable  condition  (III)  and  the  simple  observation  that  u must  be  a limit 
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member  of  f,  for  every  b e 01  u>,  f is  definable  in  (L[a#],e,a#)  in  terms  of 
members  of  Ia»  fl  u.  By  absoluteness  of  a*  it  follows  that  the  fth  member  of  Ia 
is  definable  in  L[a#]  in  terms  of  members  of  Ia*  fl  u,  and  so  that  member  is  less 
than  u (by  condition  (II)  of  §9).  But  this  holds  for  any  f < u,  so  it  follows  that 
the  ordertype  of  Ia  fl  u is  u.  ~\ 

Assuming  Va  e (coexists),  set 

/*  = {us,  | 0 < f e On} 
with  the  indexing  in  increasing  order;  then 

u i = a i and  Vf  > 0 (u$  < a>$)  , 

the  first  as  every  countable  ordinal  is  coded  by  a real,  and  the  second  as  I*  contains 
every  uncountable  cardinal. 

14.18  Proposition  (ZF)(Solovay).  Assume  Vu  e wa>(a#  exists)  and  f > 0.  Then: 

(a)  cf(w£+i)  = cf(«2)  < oj2. 

(b)  For  any  y < u%,  there  is  an  a e 01  a>,  a Skolem  term  t,  and 

0 < fi  < . . . < < f such  that 

y = t{Llal€’a)(u^,...,u^. 

Proof.  Define  n : On  x wco  ->  On  by 

n(a,  a)  — least  member  of  Ia  — (a  + 1)  . 

Then  for  any  f > 0, 

(*)  sup({«(M|,  a)  | a e “"co})  = M|+i  . 

To  show  this,  note  that  we  have  <,  and  so  it  suffices  to  show  that  the  supremum 
is  in  I*.  But  for  any  b e U,u>, 

sup({n(w£,a)  | a € "co})  = sup  ({n(u$,a)  \ a e “at  A be  i[fl]}) 

by  14.17(a)  since  for  any  a e wco  there  is  an  d e 0>co  such  that  { a , b}  c L[a],  and 
the  latter  supremum  is  in  //,  as  it  is  closed. 

To  establish  (a),  note  that 

(**)  If  n(u\,  a)  = n(u\,  b),  then  n(u^ , a)  — n(u§,  b)  . 

To  see  this,  let  c e wco  code  a and  b,  e.g.  (c)o  = a and  (c)i  = b.  Then  it 
is  straightforward  to  see  by  14.17(a)  and  absoluteness  of  sharps  that  there  is  a 
formula  f(vo)  such  that  for  any  x e lc», 

n(x,  a)  = n(x,  b)  iff  ( L[c# ],  e,  c#)  [=  fix]  . 

Hence,  (**)  follows  by  indiscemibility. 
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By  (*),  { n(u\,a ) | a e “&>}  is  cofinal  in  u2,  and  by  (**)  the  map  sending 
n(u\,a)  to  n(u%,a)  is  well-defined.  Hence  (a)  follows. 

To  establish  (b),  proceed  by  induction  on  £ and  then  on  y . So,  it  can  be 
assumed  that  £ = f + 1 for  some  f > 0 and  < y < u^+i-  Since  y £ /*,  there 
is  an  a e mco  such  that 

y = t0  (xi,  ...,xm,y  i, 

for  some  Skolem  term  to  and  x\  < . . . < xm  < y i < . . . < yn  all  in  Ia  with 
xm  < y < y i.  The  x, ’ s can  be  replaced  by  members  of  I*:  Inductively,  each  x,  is 
definable  in  (L[a;],  e,  a,}  for  some  a,  e w u>  in  terms  of  members  of  I*  less  than 
U(+ 1,  and  hence  so  definable  in  e,fl;}  -<  (L[a,],  e,a,}  for  some  (any)  S 

in  I*  — Mf+i.  So,  if  necessary,  a can  be  replaced  by  a real  coding  a,  a\, . . . , an 
and  each  x,  by  its  definition,  subsuming  the  8 into  the  y,  ’s  by  14.17(a).  Thus, 

(*  * *)  Y = t[L[al€M)  (ufl yi , y„) 

for  some  new  Skolem  term  t\ , with  |i  < . . . < 

Next,  by  our  initial  observation  (*),  y < u^+ 1 implies  that  there  is  a e "cu 
such  that  y < n(u^,b).  As  before,  replacing  a by  a real  coding  a and  b if 
necessary,  it  can  be  assumed  by  14.17(a)  that  b = a. 

Finally,  by  the  absoluteness  of  sharps,  (***)  also  holds  in  the  sense  of  L[a#~\. 
Arguing  there  with  the  remarkable  condition  (III)  applied  to  y < n(ut,  a)  and  the 
fact  that  I a contains  every  uncountable  cardinal,  the  v,  ’s  can  be  replaced  by  the 
first  n cardinals  in  the  sense  of  L[av\  greater  than  . But  these  cardinals  are 
definable  in  L[a#]  from  u.%,  and  hence 

y — t\  \uHi.  •••>«{,,  Mf) 

for  some  Skolem  term  t2  and  |i  < . . . < f to  complete  the  proof.  H 

That  14.18  is  provable  in  ZF  was  noted  for  a later  purpose.  With  the  Axiom 
of  Choice  co2  is  of  course  regular,  and  that  a>2  e I*  has  the  following  consequence 
in  ZFC: 

14.19  Corollary.  un  < cot,  for  every  n < co.  H 

These  results  will  be  applied  in  the  forthcoming  section. 
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15.  Sharps  and  L3  Sets 

Martin  and  Solovay  in  their  [69]  established  results  about  Ll  sets  under  the  ex- 
istence of  sharps  that  began  the  use  of  large  cardinal  hypotheses  to  extend  the 
methods  of  classical  descriptive  set  theory.  Martin  went  on  to  provide  a sharp  tree 
representation  of  sets,  and  these  first  explorations  broke  the  ground  for  major 
developments  over  a decade  later:  the  analysis  of  the  projective  sets  under  the 
Axiom  of  Determinacy  (§§29, 30),  and  the  consistency  of  that  hypothesis  relative 
to  large  cardinals  (§32).  Against  this  larger  backdrop  the  results  for  £3  have  a 
transitional  flavor,  but  they  were  to  form  the  bases  of  the  grander  edifices  to  follow 
and,  being  close  enough  to  the  bedrock  of  £},  to  figure  in  important  absoluteness 
arguments. 

Unlike  previous  sections  this  one  is  largely  devoted  to  a single  technical 
development  - one  that  serves  to  motivate  later  generalizations  and  substantiate 
how  large  cardinals  provide  contextually  optimal  analyses  of  the  projective  sets. 
A tree  representation  for  £3  sets  implicit  in  Martin- Solovay  [69]  was  brought  to 
light  by  Mansfield  [71]  using  a measurable  cardinal;  this  approach  is  first  taken 
because  of  its  own  significance  and  simpler  context,  and  then  the  context  refined 
to  follow  from  the  existence  of  sharps,  as  was  done  by  Martin.  Kechris  [81]  is  a 
good  reference. 

The  idea  is  to  develop  a tree  representation  for  £3  sets  based  on  one  for  L] 
sets,  emulating  how  a tree  representation  for  £(  sets  was  developed  based  on  one 
for  Lj  sets.  Recapitulating  from  13.13  and  13.14,  every  set  is  w-Suslin,  and 
to  show  that  every  £(  set  *s  tt>i-Suslin  it  sufficed  to  show  that  every  n[  set  is 
a>i-Suslin.  But  if  A C k(toa>)  is  Ft},  then  there  is  a tree  T on  ka>  x a>  such  that  for 
any  w e *"’("&>), 

A(w)  **  w ^ p[T\ 

**  rw  is  well-founded 

**  3g(g:  Tw  ->  an  is  an  order-preserving  map) 

**  w e p[T ] , 

where  T is  a tree  defined  on  ka>  x a>\  from  T as  in  the  proof  of  13.14.  Thus,  a tree 
representation  for  Lj  sets  was  converted  into  one  for  IlJ  sets  by  appealing  to  the 
equivalence  between  the  nfF  property  of  well-foundedness  and  the  HfF  property 
of  having  an  order-preserving  map  and  formulating  a new  tree  of  approximations 
to  such  a map. 

The  tree  representation  for  sets  is  to  be  similarly  “dualized”  to  get  one  for 
II2  sets  and  hence  for  l.'  sets.  For  this  purpose,  a more  specific  representation 
for  £,  sets  is  firsl:  derived: 

Suppose  that  B C k{a,(D)  is  £3.  Taking  k — 1 for  simplicity,  let  T be  a tree 
on  2u>  x a>  such  that  for  any  v e wa>, 

B(x)  **  3 1 y (7)^,2)  is  well-founded) . 
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T can  be  defined  from  a relation  recursive  in  a real  (13.1),  and  by  slightly  altering 
that  relation  if  necessary  it  can  be  assumed  that  (0,0,0)  G T. 

In  terms  of  the  fixed  recursive  enumeration  (s,  | i g a>)  of  <wa>  such  that 
|s/|  < i,  for  any  x,y  ewco  let  <x^y  be  that  strict  linear  ordering  of  co  defined  by: 

i <a'.v  j s i T(x,y)  A 5/  ^ 7"{a.v)  a / < j : else 

S;  ^ T(Xiy)  a s j e T(x,y)  ; else 

Si  €=  T{x,y)  A S j G T(x,y ) A S;  <KB  S j 

where  <kb  is  the  Kleene-Brouwer  ordering,  defined  before  13.2.  Thus,  those  i’s 
so  that  Sj  is  not  in  T(Xyy)  appear  first  in  their  indexing  ordering,  followed  by  those 
i’s  so  that  Sj  is  in  T{x,y)  ordered  according  to  <Kb-  Note  that  since  (0,  0,0)  e T 
and  so  = 0 (as  |s,-|  < i),  0 is  the  maximum  of  <Xyy  by  the  definition  of  <kb-  The 
following  is  a consequence  of  13.2. 

15.1  Lemma.  For  any  x G “&>,  B(x)  **  31y(<x,v  is  a well-ordering).  H 

For  s,  t G <u>co  with  |s|  = |t|,  temporarily  setting 

T(Sit)  = {w  | 3m  < |s|((,s|m,  t\m,  w)  G T)} 

let 

<SJ  = that  strict  linear  ordering  of  |s|  = {/  | i < |s|} 

defined  like  <a,v  but  with  T(x,y)  replaced  by  fsy  . 

For  |s|  > 0,  note  that  0 is  again  the  maximum  of  <s  t.  If  s,  t,~s,l  G <coa>  with 
|s|  = \t\  and  for  some  m < |s|,  s = s\m  and  t — t\m,  then  S;  G iff  s,  G T,-y 
for  i < m since  |s,  | < i.  Consequently,  <Sy,  C <-j  as  relations.  Similarly,  for 
x,  y G wco 

i Ax,  v j [ff  i <x|m,y|m  j 
for  any  m e co  such  that  i,  j < m,  and  so 

(*)  <x,y  = U meal  <x\m,y\m  ■ 

In  terms  of  a regular  k > to  be  further  specified  later,  a tree  T*  on  2co  x k 
is  now  defined  by 

T*  = {(s,  t,  u)  G \JmeJ"'co  x mco  x "V)  | Vi,  j < M(w(Q  < u(j)  i <SJ  j))  . 

T*  is  like  the  T defined  in  the  proof  of  13. 14  with  k = co\,  except  that  the  linearity 
of  <s  t affords  the  possibility  of  iff.  The  following  is  a consequence  of  15.1  and 
(*)• 

15.2  Lemma.  For  any  x G “&>,  B(x ) **  31y3g  G ®k((x,  y,  g)  G [T*]).  H 

Toward  a tree  representation  for  ILj  sets,  suppose  that  A c k(‘°co)  is  such  a 
set.  Again  taking  k = 1,  A can  be  considered  to  be  the  complement  of  B above 
so  that  for  any  x G wa>, 
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A(x)  **  V1  yWg  € <u/f3°m((x|m,  y\m,  g\m)  T*) 

**  T*  is  well-founded  . 

To  convert  as  in  the  n|  case,  note  that  this  in  turn  is  equivalent  (in  ZF)  to  the 
existence  of  an  order-preserving  map:  T*  — »■  k+  by  12.9(a),  as  T*  is  injectible 
into  k.  A nicety  for  now  but  of  consequence  later,  the  k+  here  can  in  effect  be 
replaced  by  k: 

First,  in  the  definition  of  B the  maximum  of  <SJ  is  0 by  arrangement  so 
that  if  (s,  t,  u ) e T*  with  |s|  > 0 then  u( 0)  = max(ran(M)).  Next,  define  a strict 
ordering  <KB2  on  T*(—  {{ t , u)  | 3°m((x|m,  f,  u)  e T*)})  as  follows: 

{ t , u)  <kb2  (7,  k)  iff  t — u = 0 A |7|  = \u\  > 0 ; else 
t d 1 A u D u ; else 
3 m < |f|(t|m  = t\m  A u\m  — u\m  A 
(u(m)  <ii(m)  V (u(m)  — u(m)  A t(m)  <t(m))))  . 

Thus,  (0,  0)  is  the  minimum,  and  the  other  (f,  u)  are  ordered  according  to  an 
antilexicographical  version  of  <kb-  If  (f,  u)  <kb2  ( t , u)  then  either  u — 0 or  else 
u( 0)  < 77(0),  and  since  m(0)  = max(ran(w))  as  noted  above,  it  follows  that  the  set 
of  predecessors  of  (7,  u)  has  cardinality  \<c°co  x <tt>(77(0)  -F  1)|  < k.  On  the  other 
hand,  {((0),  (§))  | $ < k}  is  a subset  of  T*  of  ordertype  k under  <kb2-  Hence, 
by  arguments  as  for  <kb, 

T*  is  well-founded  iff  T*  is  well-ordered  by  <KB2  in  ordertype  k . 

(0,  0)  was  made  the  minimum  of  <^2  to  render  the  ordertype  k (rather  than  k + 1), 
whereas  the  well-foundedness  of  T*  is  with  respect  to  D*  (as  described  before 
12.9)  for  which  (0,  0)  is  the  maximum.  Accepting  this,  we  have  the  following: 

15.3  Lemma.  For  any  x e 01  a>, 

A{x)  -<->  3/(/:  T*  — {(0,  0)}  — ► k is  an  order-preserving  map)  . H 


This  is  recast  (cf.  13.14)  in  terms  of  getting  an  infinite  branch  through  a tree: 
For  .S',  t € <a>co  with  |s|  = \t\,  set 

T*t  = {u  | (5,  t,  u)  e T*}  . 

Then  stipulate  for  s e <Mco  that 

{hi  | i < |s|)  is  orderly  for  s iff  for  i,  j < |s|  , 

(a)  hi '■  Cls,|.s,  ^ K ■ and 

(b)  if  s,-  DSj  /0  and  u e 7S*|S.|_S;,  then  /?,  («)  < hj  (u  \ \ s;- 1 ) . 

((b)  imposes  no  conditions  on  ho'.  {0}  — > k,  which  corresponds  to  (0,  0)  e T*\  it 
is  only  maintained  for  notational  convenience.)  Now  let 
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T+  = {(5,  (hi  | i < |s|»  | s e <aco  A ( h ,•  | i < |s|)  is  orderly  for  5}  . 

The  following  is  then  straightforward. 

15.4  Lemma.  For  any  x € “&>,  A(x)  **  ■3hV°m((x\m,  h\m)  e T+).  H 

This  is  a sort  of  Suslin  property,  but  one  with  ordinal-valued  functions  /?,• 
composing  the  tree  rather  than  ordinals.  The  final  step  involves  ranking  the  hj’s 
in  a coherent  way,  and  the  crucial  idea  is  to  assign  to  the  hi ’s  ordinals  that  they 
represent  in  well-founded  ultrapowers. 

Note  first  that  the  tree  T*  has  the  following  homogeneity  property:  For  s,  t e 
<toa>  with  |s|  = \t\,  <s,t  is  a strict  linear  ordering,  and  so  for  any  u e T*t, 
ran(w)  e [/r]|s|.  Moreover,  for  any  w e [/c]|s|  there  is  a unique  u e T*t  with 
ran(w)  = w,  the  one  that  orders  the  members  of  w to  satisfy  the  definition  of  T*. 
Hence,  the  range  function  ran:  T*t  ->  [/c]bl  is  a bijection. 

k is  now  specified  to  be  a measurable  cardinal,  and  the  homogeneity  of  T*  is 
used  to  define  ultrafilters  over  T*t  as  follows:  Let  U be  a normal  ultrafilter  over 
k.  For  .s',  t € <a)a>  with  |s|  = \t\,  define  USJ  by: 

X € Us<t  iff  X c T*t  A 3 He  U([H]M  c ran“Z)  . 

By  Rowbottom’s  result  7.17,  for  any  n e co  and  Y c [at]"  there  is  an  H e U such 
that  either  [H]n  c Y or  else  [H]n  fl  Y = 0.  It  follows  that  USJ  is  a K-complete 
ultrafilter  over  Tf . (Here,  the  principal  ultrafilter  f/0j0  = {{0}}  is  being  allowed 
for  notational  convenience,  contravening  a convention.)  Let 

js.t:  V < M,j  = Ult(V,  USJ) 

(so  that  j0  0 is  the  identity  and  M0-0  = V ).  For  each  (s,  (hi  \ i < |i|))  e T+,  hj 
is  to  be  replaced  by  the  ordinal  [hf  that  it  represents  in  the  ultrapower  MS||S.|iS. . 
Since  [hj]  < js\\Si\,Si(K)  by  condition  (a)  of  orderliness,  the  result  will  be  a tree  on 
a>  x Yu,  where 


Yu  = sup ({js,,(K)  | s,t  e <coa>  A |s|  = |f|})  . 

(Yu  has  a simple  description  in  terms  of  U,  but  this  will  not  be  needed.) 

Toward  expressing  the  condition  (b)  of  orderliness  in  terms  of  ordinals,  note 
the  following  coherence  property:  If  s,  t,s,t  e <wco  with  |T|  = \t\  = n say  and 
s — s\m  and  t — t\m  for  some  m < n,  then  for  X C Tft, 

X e U,,t  iff  [u  e r | u\m  e X}  e U-sJ 

since  for  H e U,  [H]'n  c ran“X  iff  [ H ]"  c [w  e [v]"  | [w]m  c ran“X}.  As  is 
readily  checked,  there  is  a corresponding  elementary  embedding 

j(s,t),(s,i)'  Ms,t  A A7-j 

defined  by 
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jM.M  (LfhJ  = [(mm)  | u e T;-)]u_:t 

satisfying  jist\isj\  o jSi,  = js~t.  Condition  (b)  of  orderliness  can  now  be  suitably 
rendered,  recasting  T+  as  the  following  tree  onwx  yv: 

T = {(s,  (Si  | i < |i|»  e U meJmv  x ’"Yu)  I 

j < M(S/  D Sj  ^ 0 — > Sj  < 7(j||Sj|,Sj),(s||S;,Sj)(^/))!  ■ 

The  coherence  property  of  the  Usf s provides  a real  Suslin  property: 

15.5  Proposition.  A = p[f]. 

Proof.  The  characterization  15.4  of  A in  terms  of  branches  through  T+  and  how 
T was  defined  from  T+  readily  implies  that  A c p[T], 

Conversely,  suppose  that  x e p[T],  say  with  (x,  g)  e [f].  For  i e co  let 
hp.  7^,  , s.  ->  On  be  such  that  g(i)  = [hi]  in  MV||S.|jSi.  Then  by  the  definition  of 
T,  for  Si  D Sj  / 0, 

{u  € rc||S(|is.  | hj(u ) < hj (u | |Sy |)}  G U x 1 1 s, | , s,  ? 
and  so  there  is  an  Xij  e U such  that 

(t)  ran(w)  e [Z,7]|s-'  hfu)  < hj(u\\Sj\)  . 

Set  Y = Pi { Xij  | i,  j e co  a s,  d s j f 0}  e U. 

Assume  now  to  the  contrary  that  -'A(x).  Then  going  back  to  the  earlier 
juncture  15.1  in  terms  of  B,  there  is  a y G wco  such  that  <x<y  is  a well-ordering. 
As  Y is  uncountable,  there  is  a function  d:  co  ->  Y such  that  for  i,  j e to, 

d(i)  < d(  j)  iff  i <x,y  j . 

This  implies  that  for  every  m G co,  d\m  G Tfn  y^m,  and  hence  by  (t)  that  whenever 
y D Si  D Sj  f 0,  hi(d\\Si\)  < hj(d\\Sj\).  This  leads  to  an  infinite  descending 
sequence  of  ordinals  and  thus  a contradiction.  H 

The  foregoing  analysis  will  soon  be  adapted  to  a hypothesis  weaker  than 
measurability,  but  we  draw  on  its  strength  to  derive  an  interesting  and  useful 
conclusion  based  on  the  definability  of  T.  This  result  extends  Absoluteness 
13.15,  and  applicable  is  the  discussion  before  it  of  the  relativization  of  analytical 
relations  in  terms  of  their  normal  forms  and  the  corresponding  formulation  of 
absoluteness. 

15.6  Theorem  (Martin-Solovay  [69:  152]).  Suppose  that  A is  the  Tlj  set  as  above 
with  associated  tree  T defined  from  the  normal  ultrafilter  U over  k,  and  P is  a 
p.o.  satisfying  \ P\  < k.  Then 

3 xxA(x)  iff  |p  3lxA(x)  . 


15.  Sharps  and  Sets 


197 


Hence,  if  there  is  a measurable  cardinal,  £3  relations  are  absolute  for  V in  any 
generic  extension  via  a p.o.  P of  cardinality  less  than  that  cardinal. 

Proof.  By  the  argument  for  the  Levy-Solovay  result  10.15, 
lb  U = {T  c K | 3Z  e U(X  c 7)} 

is  a ^-complete  ultrafilter  over  k.  Moreover,  as  for  (*)  in  that  argument, 
but  incorporating  Rowbottom's  7.17,  we  have  the  following:  If  n e w and 
p |b  f:  [*]"  ->  V,  then  there  is  a q < p,  a Z e U,  and  an  /:  [Z]"  ->  V 
such  that  q |b  f |[Z]  = /. 

Suppose  now  that  G is  P-generic.  Then  it  is  straightforward  to  check  with 
the  property  just  established  that  for  s,t  e <®(u  with  |sj  = \t\,  if  jGt  is  defined  in 
V[G]  from  UG  just  as  jSJ  had  been  defined  in  V from  U,  then  jGt  and  jSJ  agree 
on  the  ordinals.  Consequently,  if  Tc  is  defined  in  V[G]  as  T had  been  defined 
in  V,  then  Tc  — T. 

With  this  absoluteness  property  the  argument  can  now  be  concluded  as  for 
13.15:  If  in  V there  is  an  x e Ww  such  that  A(x),  then  x e p[T]  c (p[P])v[G]  and 
so  A'/[c](.r).  Conversely,  if  in  V there  is  no  x e 01  u>  such  that  A(x),  then  [P]  = 0, 
i.e.  T is  well-founded  and  so  there  is  an  order-preserving  map:  T — > On.  Such 
a map  also  exists  in  V[G],  and  so  [W] = 0,  i.e.  in  V[G]  there  is  no  x e wco 
such  that  Av[G](x).  H 

Hence,  if  there  is  a proper  class  of  measurable  cardinals,  then  L3  relations 
are  absolute  for  V in  any  generic  extension.  Generic  absoluteness  would  become 
an  important  consideration  in  the  study  of  strong  hypotheses. 

Another  analysis  of  Lj  sets  is  provided  next,  one  not  needed  for  present 
purposes  but  through  which  15.5  could  have  been  established.  This  approach  was 
to  be  the  basis  of  a generalization  through  the  projective  hierarchy  using  strong 
hypotheses  (see  32.6  about  weakly  homogeneous  trees). 

For  x,  y e wu>  it  is  simple  to  check  that 


{{A^x\m,y\m  I ^ ^ to) , (j {x\m,y\m),{x\n,y\n)  I ^ n)) 


is  a directed  system  (§0),  so  stipulate  that 

{Mx  y,  EXty)  is  the  direct  limit,  and 

kx\m,y\m  • x\m,y\mi  ^ ^ x . y > Ex  f) 

the  corresponding  embeddings  so  that  for  m < n, 


kx\n,y\n  ° j (x\m,y\m),(x\n,y\n)  — Xx\m,y\m 


In  what  follows,  B is  the  representative  set  °f  which  we  have  been  viewing  A 
as  the  complement. 
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15.7  Exercise. 

(a)  For  any  x,  y e “‘co,  {MXty,  Ex  y)  is  well-founded  iff 

(*)  whenever  Xm  e Ux \m,y\m  for  m e co  there  is  a 

d e “ k such  that  d\m  e Xm  for  every  m e co. 

(b)  For  any  x e “‘co, 

B{x)  **  31y{{MXty,  Ex  y)  is  well-founded)  . 

Hint,  (a)  Suppose  that  (*)  holds,  and  assume  to  the  contrary  that  (M(J,  Ex<y)  is 
ill-founded,  i.e.  there  are  [zi  I i e of  c Mxy  such  that  Zi+i  Exy  Zi  for  every  i e co. 
For  each  such  i,  let  m,  e co  and  /;, : T*^m.  y^m,  — »■  V be  such  that  kx\mjyy\mi([hj])  — 
Zi,  with  m j < mI+i.  It  follows  that 

[^/+l]  ^ j(x\mi,y\mi),(x\mi+i,y\mi+i)  ] fo^*  every  i € CO  . 

This  is  similar  to  a situation  in  the  argument  for  15.5,  so  invoking  (*)  get  an 
appropriate  d e 10 k as  there  to  get  a contradiction. 

Suppose  next  that  (*)  fails,  i.e.  there  are  X„,  e Ux \m,y\m  for  m e co  with  no 
d emK  such  that  d\m  e Xm  for  every  such  m.  Set 

W = {w  e <0>k  | Vm  < \w\(w\m  e X,,,)}  . 

Then  (W,  D)  is  a well-founded  tree  on  k and  so  has  a rank  function  pw  (see 
before  12.9).  By  the  coherence  property  of  the  USt,’s,  W fi  Xm  e \JX \m<y\m  for 
every  m e co.  Now  show  that  for  such  m, 

fx\m+\,y\m+\  ([PW  I (Ik  Ft  2fm^_i)])  Exy  kx^m^  y|m  ([p^/ 1 ( W Ft  2fm)]) 

to  confirm  that  Mx  y is  ill-founded. 

(b)  If  there  is  a y e wco  such  that  (Mx,y,  EXiy)  is  well-founded,  then  by  (*) 
there  is  a d e wk  such  that  (x,  y,  d)  e [T*],  so  that  B(x).  For  the  converse,  argue 
as  for  15.5  to  establish  (*)  for  a y e wco  such  that  <*iy  is  a well-ordering  by 
finding  a d:  co  ->  Y for  an  appropriate  Y e U.  ~\ 

The  Analysis  with  Sharps 

Having  developed  a tree  representation  for  Ilj  (and  hence  ) sets  with  the  broad 
strokes  afforded  by  a measurable  cardinal,  the  analysis  is  now  refined  to  establish 
results  in  ZF  assuming  only 

Yu  e “ co  (a # exists)  . 

This  hypothesis  was  discussed  at  the  end  of  §14,  and  the  results  established  there 
will  soon  be  applied.  In  effect,  the  approach  of  Mansfield  [71]  is  incorporated 
into  the  original  Martin-Solovay  [69]  context,  as  was  done  by  Martin. 

Following  the  previous  theory  for  our  representative  Ilj  set  A up  to  the 
formulation  of  T+,  the  first  task  is  to  replace  the  normal  ultrafilter  U by  an 
adequate  analogue.  For  this  purpose  we  simply  take 


K = CO\ 
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and  the  closed  unbounded  filter  Cm  over  oy  restricted  to  a suitable  class.  Recalling 
that  for  a e “’co,  L[a ] — L[{(n,  a(n))  \ n e co}]  as  stipulated  in  §14,  set 

L = LL^M . 

the  class  of  sets  constructible  from  some  real.  L C L(R)  = L(0>oj),  and  the 
equality  L — L(Rj  holds  iff  M.  e L[a ] for  some  a e wa>  iff  L is  an  inner  model. 
Z serves  as  a convenient  class,  rich  enough  for  the  full  play  of  the  reals,  but 
restricted  enough  to  be  constrained  by  the  sharps. 

15.8  Lemma.  For  any  n e a>  and  Y e 'P  ( [o>  ] ]" ) fl  L,  there  is  an  H closed 
unbounded  in  co\  such  that  either  [//]”  c Y or  else  [H]n  fl  Y = 0. 

Proof  Suppose  that  Y e T(\co\f)  fl  L[a ] where  a e “co.  Then 
Y = t{LM'e'a) (xu  ■ ■ ■ , xm,  yu  ■ ■ ■ , yP) 

for  some  Skolem  term  t and  x\  < ...  < xm  < y\  < . . . < yp  all  in  Ia , the  class 
of  a#  indiscemibles,  with  xm  < cn\  < y±.  If  H = Ia  fl  (co\  — ( xm  + 1)),  then 
H is  closed  unbounded  in  ay , and  either  [H]n  C Y or  else  [H]"  (IF  = 0 by 
indiscemibility.  H 

With  this,  Ca H fl  L can  be  used  in  place  of  U to  define  analogues  USJ  to  USJ 
for  s,  t e <mu>  with  |s|  = \t\  by 

XeUsj  iff  XeV(T*,)nL  a (3H  e Cm  n L)([//]|s|  c ran“X)  . 

([/0,0  = {{0}}  is  allowed  as  before.)  Note  that  T*t  e L,  since  it  is  simple  to 
check  that  T*  e L[b]  c.  L for  any  b e 01  co  such  that  A is  n\(b)  (cf.  13.14;  as 
there,  oo\  may  differ  from  co\^).  It  follows  by  15.8  that  U SJ  is  an  co\-complete 
ultrafilter  on  V(T*t)  fl  L.  (In  our  approach  through  a motivating  analogy,  some 
use  of  AC  is  needed  to  choose  countably  many  7/’s  in  Cm  fl  L here,  and  to  get 
an  order-preserving  map  into  On  from  well-foundedness  below.  DC  suffices,  but 
as  we  note  before  the  summarizing  result  15.12,  all  uses  of  choice  principles  can 
be  eliminated  from  its  proof.) 

As  before  there  is  a coherence  property:  if  s,  t,s,t  e <mco  with  |T|  = \t\  and 
s — s\m  and  t = t\m  for  some  m < |s|,  then  for  X e V ( Tf ) fl  Z, 

X e U,,t  iff  {u  e T*7  \ u\m  e X}  e U-j  . 

What  about  the  analogues  for  the  maps  Note  that  only  their  restric- 

tions to  On,  indeed  to  yu,  had  been  needed.  So,  first  consider  for  s.t  e <mco  with 
|s|  = |r|  the  ultrapower  of  (On,  <)  by  USJ  using  only  functions:  T*t  — »■  On  that 
are  members  of  Z: 
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For  such  a function  /,  set 

(f)uaJ  = {geL\g : T*t  ->  On  a {u  e T*t  | /(«)  = g(u)}  e Us,t]  . 

(Note  that  if  / e i[flo]  and  g e L[a\\  for  some  ciq,  a\  e ®<y,  then  {/,  g}  c L[a] 
for  any  a e “’co  coding  ao  and  a\,  so  that  {u  e T*t  \ f(u ) = g(u) } e L[ci\  c L.) 
Now  define  (/)5_  as  in  §5  to  consist  of  those  members  of  (f)jj  of  minimal 

rank,  so  that  (/)5_  is  a set.  The  domain  of  the  ultrapower  can  then  be  formulated 
as 

r"On /USJ  = {(/)!  ( | /:  r*f  — >■  On  A / e Z} 
with  its  ordering  <jj  defined  by 

( <ust  iff  {«  e T*t  | /(«)  < *(u)}  e . 

(Note  that  [u  e r*f  | f(u ) < #(«)}  e L as  before.)  Since  USJ  is  &>i-complete, 
this  ultrapower  is  well-ordered  (cf.  5.3),  and  hence  there  is  a unique  isomorphism 

\ (r'*'On /Us,,,  <ysi)  (On,  <)  . 

As  in  §5,  set 

There  is  a corresponding  ultrapower  embedding  j s t:  On  ->  On  given  by  j = 
[/„]j 7 , where  fa  is  the  constant  function:  T*t  ->  {a}. 

For  s,  t,s,t  e <0)co  with  |s|  = \t\  and  s — s\m  and  t = t\m  for  some  m < |s|, 
define  analogues 

On->  On 

t0  ) by; 


jM,^)(inuJ  = [</(«l»0  I « e . 

The  coherence  property  for  the  ultrafilters  insures  that  these  maps  are  order- 
preserving. 

From  the  tree  T+,  a new  tree  analogous  to  the  former  7 is  now  defined. 
This  time,  for  (s,  {hi  \ i < |s|))  e T+,  hi  is  to  be  replaced  by  the  ordinal  [hi] 
that  it  represents  in  the  ultrapower  of  On  by  Us ||Sf|,s;-  Since  [/?,]  < jsus.\t s.(«i)  by 
condition  (a)  of  orderliness,  the  result  will  be  a tree  on  a>  x y,  where 

7 = sup({/s  ,(a>i)  | s,t  e <<o0)  a |s|  = |/|})  . 

Important  conclusions  will  soon  be  drawn  from  a detailed  analysis  of  y,  but  for 
now  the  condition  (b)  of  orderliness  is  cast  in  the  formulation  of  the  new  tree, 
which  again  will  be  called  T : 
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f = {{s,  (5,-  I i < |s|»  € LUX"  X mY)  I 

Vi,  j < Is  I (s,-  3^0  ->  5,-  < 7(J||Sj.|,Sj.>,{s||Si,Si)(^))} 

The  analogue  of  15.5  holds: 

15.9  Exercise.  A — p[T ] (with  the  current  definition  ofT). 

Hint.  To  show  that  A c p[T],  suppose  that  x e wco  and  A(x),  i.e.  T*  is  well- 

founded.  T*  e L[b]  for  b e “co  such  that  A is  n\(b),  so  if  c e “co  codes  b 

and  x,  then  T*  e L[c].  Arguing  as  for  L Absoluteness  13.15,  T*  is  also  well- 
founded  in  L[c],  and  so  applying  15.3  in  L [c]  there  is  an  order-preserving  map 
/:  T*  — {(0,  0>}  —¥■  co i with  / e L[c].  Such  a map  can  be  elaborated  as  a set 

{hi  | i e co}  c L[c]  c L such  that  (x|m,  {ho, . . . , hm-\))  is  orderly  for  x|m  (as 

in  the  definition  of  T+)  for  every  m e co.  Consequently,  if  g:  co  ->  y is  defined 
by  g(i ) = [/;,],  the  equivalence  class  in  the  sense  of  £/x||s,|.s;,  then  (x,  g)  e [Z]  to 
confirm  that  x e p[T\. 

The  converse  is  just  as  for  15.5.  H 

Whenever  there  are  Suslin  representations  for  the  fZ,  sets,  there  are  corre- 
sponding representations  for  the  £3  sets,  and  some  observations  can  be  made  about 
their  absoluteness  and  regularity  properties.  Suppose  that  for  x e “ co , 

C(x ) **  31yD(v,  y) 

where  D C 2(m®)  is  77-,1  ( a ) for  some  a e co,  and  there  is  a tree  T°  on  2co  x k for 
some  k such  that  D = p[TD].  Then 

C (x ) T®  is  ill-founded  . 

Recall  the  discussion  before  13.15  about  the  relativization  of  analytical  relations 
in  terms  of  their  normal  forms  and  the  corresponding  formulation  of  absoluteness. 

15.10  Exercise  (Mansfield  [71]).  Suppose  that  M is  an  inner  model  with  {a,  T°}  C 
M.  Then: 

(a)  C is  absolute  for  M. 

(b)  If  “co  fl  M is  countable,  then  C is  Lebesgue  measurable,  has  the  Baire 
Property,  and  has  the  perfect  set  property. 

Hint,  (a)  For  any  x e “co  C\  M, 

C(x)  **  T()  is  ill-founded 

(Txd  is  ill-founded)" 

**  Gy  e “co  fl  M){T^y^  is  ill-founded)" 

**  Gy  e “co  fl  M)(T»y)  is  ill-founded) 

(3y  e “co  fl  M)D{x,  y) 

**  (3y  e “co  fl  M)Dm(x,  y)  (by  Absoluteness) 

CM(x)  . 
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(b)  For  the  first  two  properties,  note  that  by  (a), 

C = j.r  e | M[x]  |=  <p[x]} 

for  some  formula  p,  and  use  Solovay’s  arguments  for  14.1  and  14.2.  For  the  third 
property,  apply  14.7.  H 

The  significance  of  the  tree  T on  w x y lies  not  only  in  such  properties, 
but  also  in  the  possibility  afforded  by  the  sharps  of  establishing  a contextually 
optimal  Suslin  property  low  in  the  cumulative  hierarchy.  This  is  a consequence  of 
a calculation  of  y in  terms  of  the  class  {m^  | 0 < £ € On}  of  uniform  indiscemibles 
discussed  at  the  end  of  §14.  The  results  established  there  lead  to  the  following 
connection  between  the  indiscemibles  and  the  ultrapower  embeddings  that  figured 
in  the  formulation  of  T : 

15.11  Lemma.  Suppose  that  s,  t e <wco  with  |s|  = \t\.  Then 

js,t (®i)  = «|j|+i  • 


Proof.  Since  js  t(co\)  is 

mVsj  | h:  T*t  ->  Wl  A h e L]  , 

we  must  show  that  the  uniform  indiscernible  u\s\+\  coincides  with  this  set  of 
ordinals. 

Suppose  that  h:  T*t  ->  u>\  with  h e L.  Then  for  some  a e a>w,  h e L[a ], 
and  moreover  by  the  proof  of  GCH  in  L[ci],  h is  the  yth  element  in  the  canonical 
well-ordering  of  L[a]  (3.3(b))  for  some  y < U2  (as  u>\  — u\  < M2).  Using  14.18(b) 
for  this  y and  coding  more  into  a if  necessary  (or  proceeding  directly  as  in  its 
proof),  it  follows  that 

h = t[L[al€’a)  {op) 

for  some  Skolem  term  t\. 

It  can  henceforth  be  assumed  that  |s|  > 0,  as  700(cui)  = &>i  = u\.  By 
definition  of  T*  there  is  a unique  permutation  tt\  |j|  ->  |j|  such  that  for  any 
to  < ■ ■ • < §1*1-1  < a>u  (§^<0)>  • ■ • - §jr(iq-i)>  e T*t.  A small  adjustment  of  tt  leads 
to  a new  Skolem  term  t such  that  for  any  § 0 < . . . < §|j|-i  < a>\, 

h((%i r(0).  • • • > §ff(k|-l)>)  = f<i[fl]'e'fl>(§0,  • • • , §|.s|-l-  o>i)  < ■ 

Cast  this  way,  h can  be  naturally  extended.  In  particular,  consider 

n(h , a , t ) = t{Lla]'e'a)(uu  M|sJ:,  M|s|+l)  < M|.S|+1  . 

(By  an  argument  used  in  the  proof  of  14.18(b),  r]{h.  a.  t)  = t ’ (mi,  . . . , M|s|) 
for  some  Skolem  term  1,  i.e.  the  dependence  on  M|s|+i  can  be  eliminated  at  the 
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cost  of  replacing  a by  a#.  However,  while  resonating  with  a surjectivity  argument 
at  the  end  of  this  proof,  this  further  transformation  is  not  needed  here.) 

First  observe  that  if  h e [/;]y  and  r](h,a,l)  is  derived  from  some  ci  and 

7 for  h , then  ?;(/?,  a,  t)  — r)(h,a,t ):  By  definition  of  USJ,  there  is  a C closed 
unbounded  in  a>\  such  that  for  §0  < . . . < ||S|_i  all  in  C,  h{{%n( 0),  ■ . ■ , ^jrflq-i)})  = 
/z({f 7r(0), . . ■ , §tt(|s|— i)>)-  Since  Ia  fl  fl  a>\  is  also  closed  unbounded  in  co\,  there 
are  such  £o  < ■ • • < §|,s|-i  all  in  Ia  fl  I„,  and  so 

t(L[a],e,a)  £|,|_l)  = h({% . . . , £jt(|s|-1))>  U>\) 

= M(§jr(0)>  • • • , £jr(|*|-l))) 

Hence,  if  c e <°co  codes  a and  a,  then  by  definability  and  indiscemibility  applied 
in  L[c], 

r](h,a,  t ) = t{L[a]'€’a) {u i, . . . , M|S|,  M|s|+i) 

— rj(h , a,  7)  . 

It  now  follows  that 

<?:  {[^];7i(  I h\  T*t  ->  wi  A h e L}  M|s|+i 

is  well-defined  by:  ) = ?7 (7i , a,  f)  for  some  a and  r for  h.  An  analogous 

argument  shows  that  e is  order-preserving.  Finally,  any  y < M|.s|+i  is  of  form 
f(L[a],€,a)(Mi,  _ ^ h^i)  for  some  a e mco  by  14.18(b),  so  it  follows  that  e is  surjec- 

tive. Hence,  e is  a bijection  (and  consequently  the  identity  map),  and  the  proof  is 
complete.  H 

That  the  map  e is  the  identity  implies  that  T could  have  been  formulated 
in  terms  of  a well-ordering  of  Skolem  terms  involving  the  w„’s  without  reference 
to  the  ultrafilters  U SJ.  Consequently,  the  foregoing  analysis  can  be  carried  out 
in  ZF  without  relying  on  a choice  principle  (DC)  to  ensure  well-foundedness 
of  ultrapowers  (cf.  Moschovakis  [80:  596ff]).  On  the  other  hand,  the  ultrafilter 
formulation  motivates  T by  embedding  it  into  a larger  context. 

It  follows  from  15.11  that 

Y = sup({;'Si?(tt>i)  \ s,  t e <mu>  a \s\  = |f|}) 

= sup({M„  | 0 < n e &>}) 

= m®  , 

and  this  leads  to  the  following  result,  incorporating  the  previous  remark  about 
provability  in  ZF. 


15.12  Theorem  (ZF)(Martin).  Assume  that  V 'a  e "o>(a#  exists).  Then  eveiy  £3  set 
is  u0,-Suslin.  H 
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The  Axiom  of  Determinacy  actually  implies  that  u„  — &>„  for  every  n < co 
(28.9).  In  the  ZFC  context  we  have  the  following  remarkable  structure  result  for 
£/  sets: 

15.13  Corollary  (Martin).  Assume  that  da  e wco(a#  exists).  Then  every  L/  set  is 
a>2-Suslin,  and  hence  a union  of  K2  Borel  sets. 

Proof.  By  14.19,  u,„  < 0)3.  The  latter  assertion  follows  from  13.13(f).  H 

This  is  analogous  to  the  classical  result  13.7,  and  implies  under  the  existence 
of  sharps  that  every  £/  set  has  cardinality  at  most  K2  or  a perfect  subset  and  hence 
cardinality  2K°. 

The  definability  properties  of  T also  lead  to  a uniformization  result  for  IT ' 
sets.  As  discussed  at  the  end  of  §13,  every  Y.\  set  can  be  uniformized  by  a £3 
set,  and  but  it  is  consistent  that  there  is  a 77/  set  that  cannot  be  uniformized  by 
any  projective  set.  The  tree  analysis  of  Mansfield  [71]  together  with  the  sharps 
analysis  of  Martin-Solovay  [69]  leads  to  the  following: 

15.14  Theorem  (Mansfield  [71],  Martin-Solovay  [69]).  Assume  that  da  e 
l0a>(a#  exists).  Then  for  any  b e ® co,  every  n^ib)  set  can  be  uniformized  by  a 
77/  ( b ) set. 

Proof.  The  structure  of  the  proof  is  as  for  the  n[  Uniformization  13.17,  which 
established:  If  A c 2(ma>)  is  77/  (a),  then  there  is  a 77/  (a)  set  Aq  c A such  that  for 
any  x e ® co , if  there  is  a y such  that  A(x,y)  then  there  is  exactly  one  y such  that 
A0(x,  y).  Availing  ourselves  of  our  representative  Tl]  set  A C 10 co  and  the  theory 
developed  for  it  through  15.9,  we  shall  establish  the  following  “basis”  theorem: 
For  b e 01  co  such  that  A is  77/  (b),  there  is  a n^(b)  set  Aq  c A such  that  if  there 
is  a y such  that  A(y),  then  there  is  exactly  one  y such  that  Ao(y).  With  only 
notational  changes  A and  Aq  can  be  recast  as  subsets  of  2(wco)  and  a parametrized 
version  of  the  proof  can  be  given  to  establish  the  usual  uniformization  assertion. 

Let  T*  on  2co  x co\  and  T on  co  x y be  as  before  so  that  for  any  y e “’co, 

A(y)  **  T*  is  well-founded 

3gewy((y,g)e[f]). 

First  observe  that  for  any  y e wco  if  there  is  a g such  that  (y,  g)  e [f ] 
at  all,  then  there  is  an  honest  leftmost  one,  i.e.  a gy  such  that  for  any  other 
g,  gy(i)  < gd)  for  every  i e to: 

Suppose  that  A(y),  so  that  T*  is  well-founded.  By  12.9(a)  and  15.3  the 
restriction  of  its  rank  function  pr»  to  T*  — {(0,0)}  is  an  order-preserving  map  into 
a>  1.  Moreover,  by  the  relativization  argument  for  15.9,  for  any  ce“w  that  codes  b 
and  v,  Pt * e L[c]  c L.  Consequently,  pT*  can  be  elaborated  as  {h\  \ i e co]  c L 
where  h]\  T* j|s,|_s,  — > co \ is  specified  by  h?(u)  — pj*({Si,u))  for  i 0 and 
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hyom  — 0.  Then  if  gy  e wy  is  defined  by  gv(i ) = the  equivalence  class  in 
the  sense  of  t/v||s,|,s,- - it  follows  that  {y,  gy ) e [f]. 

Suppose  now  that  ( y , g)  e [f]  with  g arbitrary.  For  i e co  let  /?,■:  7)*^  s,  -> 
On  be  such  that  g(i)  — [hf,  the  equivalence  class  in  the  sense  of  Uy ||Sj|,Si.  Then 
for  Si  D Sj  f 0, 

{ll  € T’ylls.-I.s,-  I hj(u)  < /ij(M||Sj|)}  € f/y||Sj|,Si  > 

and  so  there  is  a Ci;  closed  unbounded  in  u>\  such  that 

(t)  ran(M)  e [C,7]|Sil  hfu)  < hj(u\\Sj\)  . 

Set  C = Hf Cij  | i,  j e co  A s(-  D s j ^ 0},  a set  closed  unbounded  in  w\. 

Next,  let  e:  C ->  be  the  order-preserving  bijection.  Then  e induces  an 

isomorphism  between  T*  and  a tree  Tc  on  a>  x C defined  by 

(t,  u)  e Tc  iff  (t,eou)eT; 

since  the  order  relationships  for  the  second  coordinates  are  preserved.  As  A(y), 
both  T*  and  Tc  are  well-founded,  and  their  rank  functions  are  related  by 

pT<:{{t,  u })  = pT;((t , eou))  . 

Flence,  by  12.9(a)  and  how  the  hf  s make  up  an  order-preserving  map:  Tc  -a  On 
by  (t), 

Pt* ( (Sj , e o u))  < hj(u)  for  0 < i e co  and  u e |SilC  . 

Finally,  it  is  simple  to  see  that  there  is  a D c C closed  unbounded  in  co\  on 
which  e is  the  identity,  so  that 

Pt*  ({S/ , «))  < hfu)  for  0 < i e co  and  u e |Sll£>  . 

But  how  the  hyf  s were  defined  from  pT*  implies  that 

gyd)  = [h-]  < [hf  = g(i)  for  eve^  i e co  . 

Proceeding  now  as  in  the  argument  for  nj  Uniformization,  the  idea  is  to 
uniformize  by  choosing  that  y of  the  leftmost  (y,  g)  in  |T]  if  there  is  one.  With 
A being  n\(b),  that  this  can  be  done  in  a n^ib)  fashion  relies  on  the  observation 
that  g would  have  to  be  the  honest  leftmost  gy  as  described  above. 

Let  R-  and  R be  relations  on  o>  x 2 ('"&>)  given  by 

R~(i,y,z ) -’A(y)  v ^A(z)  v gy(i)  < gfi)  , and 

R<(i,y,z ) -’A(y)  v -^A(z)  v gy(i)  < gz(i)  . 

It  suffices  to  show  that  these  are  S\(b).  If  Aq  c A were  then  defined  following 
the  proof  for  13.17,  it  would  uniformize  as  desired  and  be  essentially  of  form 
n\  (b)  A V'fX'jCZ?)  -»  -’IT,1  (b)]  and  hence  TJ^b)  to  complete  the  proof. 
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To  show  that  R-  is  17'  (/;),  the  argument  for  R<  being  analogous,  note  first 
that  -'A  is  £i(b),  so  it  remains  to  show  assuming  A(y)  and  A(z ) that  gy(i)  < gz(i) 
is  S\{b): 

For  this  purpose,  recall  that  any  x e “co  codes  ((x)k  \ k e a>)  e “(“co) 
by  (x)k(m)  = x((k,m)),  and  let  a(y,z)  e “co  be  such  that  (a(y,z)) o = b, 
( a(y , z))i  = y,  (a(y,  z)) 2 = Z,  and  (a(y,  z))„  = the  constant  function:  co  — >■  {0} 
for  n > 2.  Now  by  the  characterization  of  the  honest  leftmost  gy,  for  i e co, 
gy(i)  — [hj],  the  equivalence  class  in  the  sense  of  t/v||S.|iS.,  where  in  turn 
hj:  T*llSjlSj  -»  co  1 is  given  by  = p7-.((s,,  u)).  Applying  a previous  re- 

mark, pT * e L[a(y,  z)]  as  a(y,  z)  codes  b and  y,  and  in  fact  a closer  look  shows 
that  Pt*  is  definable  in  (L[a(y,  z)],  e,  a{y,  z)),  via  the  same  formula  for  all  y and 
Z,  in  terms  of  the  one  parameter  needed  to  define  T*  there,  namely  (the  real)  co\ . 

We  next  incorporate  aspects  of  the  proof  of  15.11.  For  0 < i e co,  there  is 
a unique  permutation  zrIi3,:  |s,|  — »■  |s,  | such  that  for  any  $0  < ■ ■ ■ < f|s,|-i  < (0\, 
(%niy(0),  ■ ■ ■ ■ s,(  -ii)  £ TV*|S,|  S;.  It  follows  from  the  aforementioned  definability 

of  Pt * in  terms  of  u>\  that  for  0 < i e co  Skolem  terms  t,  can  be  defined  uniformly, 
not  depending  on  y or  z,  such  that 

b\  ((^7T,  v(0>7 fjTi.j.flSil-l)))  = Pr*«Si,  ($7TZy(0)  1 • • • , ^TT/.ydS,  1— n>>) 

= €My'Z))  (Ic...,^,-!,^). 


The  proof  of  15.11  now  shows  (through  its  mapping  e being  the  identity)  that 


_ t{L[a(y,z)].€,a(y,z)) 


m = c 


M|S,J,  M|s,-|+l) 


All  of  the  foregoing  applies,  of  course,  to  get  an  analogous  expression  for 
[h]  \ in  terms  of  some  f,-.  Consequently,  for  each  i e co  there  is  a formula  (p,  such 
that 

gy(i)  < gz(i)  ^ [hU  < [/If] 

(. L[a(y,z)],&,a{y,z ))  \=  <Pi[u  1, M|S,|+i] 

{(Pi(c0, . - - , CIS,.,)1  e a{x,y)# 

where  the  c,’s  are  constants  for  indiscernibles  in  the  §9  language  £*  for  the 
theory  of  sharps.  By  the  uniformity  of  the  definitions  it  is  straightforward  to 
see  that  the  function  taking  i to  • • • - cis.i)1  can  be  taken  to  be  recursive. 

Flence,  with  b coded  in  a(x,  y),  14.16  and  the  argument  for  14.12(c)  implies  that 
the  gy(i)  < gz(i)  is  H\(b)  (in  fact,  A\{b))  to  complete  the  proof.  H 


Adjoining  an  3 to  the  result  is  straightforward  (cf.  13.18). 

15.15  Exercise.  Assume  that  Va  e a>co(a#  exists).  Then  for  any  b e ® co,  eveiy 
27]  ( b ) set  can  be  uniformized  by  a Xjj  ( b ) set.  H 


Thus,  the  methods  and  results  of  classical  descriptive  set  theory  were  extended 
to  the  third  level  of  the  projective  hierarchy.  What  about  the  fourth  level? 
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Having  developed  a tree  representation  for  IlJ,  sets  and  hence  for  X,  sets, 
the  next  step  would  be  to  convert  the  latter  to  a tree  representation  for  II3  sets  by 
again  appealing  to  the  basic  equivalence  between  well-foundedness  and  having  an 
order-preserving  map  into  the  ordinals.  Suppose  that  E C is  II3,  say 

E(x)  **  -’3 yA(x,y) 

where  A c 2(“&>)  is  rU.  This  A can  be  regarded  as  a binary  version  of  our 
previous,  representative  A;  recalling  that  y — uw  (as  noted  before  15.12),  the  new 
A is  p[T]  for  a corresponding  T on  2co  x um.  Consequently, 

E(x)  -'3y3g  e wuMx,  y,  g)  e [f  ]) 

**  Tx  is  well-founded 

**  3 /(/:  Tx  -»  is  an  order-preserving  map)  . 

A tree  analogous  to  T*  could  now  be  developed  of  finite  approximations  to 
order-preserving  maps.  This  done,  certain  ordinal-valued  functions  would  have 
to  be  ranked,  requiring  cui-complete  ultrafilters  UStt  on  V{Xs  t)  for  some  sub- 
sets XS't  of  These  ultrafilters  would  again  have  to  satisfy  coherence  and 

well-foundedness  properties,  and  these  would  be  derived  most  directly  from  ho- 
mogeneity properties  of  T.  How  can  such  properties  be  established? 

Looking  further  into  the  construction  of  T,  if  ( s , {<5,-  | i < |s|}}  e T , there  is 
a sequence  (/;,■  | i < |s|)  orderly  for  s such  that  S,  = [/z , ] . Such  a sequence  in  turn 
corresponds  to  a function  on  Zs  = { (s,- , u)  \ 3 i < |s|((s||s,-|,  s,-,  u ))  e T*}  given  by 
h((Sj,  u))  — hj(u).  Moreover,  for  5^0,  Zs  is  seen  to  be  well-ordered  by  <Kb2  in 
ordertype  co \ by  remarks  before  15.3.  Thus,  Zs  can  be  identified  with  a member 
of  [wi]®1,  which  would  now  play  the  role  of  [cui]".  But  now,  we  would  need  a 
partition  property  for  [coi]®1  analogous  to  15.8,  i.e.  some  substantive  version  of 
the  full  a>\  — > (cO] )®' . Such  properties  contradict  AC,  but  actually  follow  from 
the  Axiom  of  Determinacy  (cf.  28.13),  and  moreover  the  ZFC  context  could  be 
maintained  by  only  assuming  that  axiom  in  the  inner  model  L(R),  which  could 
play  the  role  of  L. 

In  1971  Kunen  thus  proceeded  to  derive  a tree  representation  for  IT3  sets,  and 
forging  ahead,  similarly  comprehended  the  II4  sets.  This  work  was  to  stand  as 
the  high  point  of  the  contextually  optimal  analysis  of  the  projective  sets  for  well 
over  a decade,  further  progress  being  stymied  by  the  inability  to  gain  sufficient 
information  about  an  ordinal  known  as  <5j.  How  this  work  was  resoundingly 
completed  in  the  mid-1980’s  is  discussed  in  §30. 


Chapter  4 

Aspects  of  Measurability 


This  chapter  describes  the  wide-ranging  investigation  of  measurability  carried  out 
in  the  late  1960’s  with  the  complementary  methods  of  forcing  and  inner  models. 
These  developments  not  only  provided  a coherent  and  illuminating  structural  anal- 
ysis, but  suggested  new  questions  and  provided  paradigms  for  the  investigation 
of  stronger  hypotheses.  §16  and  §17  discuss  saturated  ideals,  a subject  brought 
to  the  forefront  by  work  of  Solovay.  The  first  section  presents  his  combinatorial 
analysis,  and  the  second,  his  consistency  result  on  real-valued  measurability  and 
then  the  cycle  of  Kunen-Paris  consistency  results  based  on  the  extendibility  of 
elementary  embeddings.  §18  describes  Prikry  forcing,  a notion  that  provided  yet 
another  understanding  of  measurability,  and  a generalization,  one  of  several  that 
were  to  become  crucial  to  the  careful  study  of  consistency  strength.  The  rest  of  the 
chapter  is  based  on  Kunen’s  work  on  iterated  ultrapowers,  inner  models  of  mea- 
surability, and  elementary  embeddings:  With  the  general  concept  of  M-ultrafilter, 
his  iterated  ultrapower  theory  is  developed  in  §19.  §20  applies  it  to  derive  the 
structure  theorems  of  Kunen  and  Silver  on  inner  models  of  measurability.  Finally, 
§21  gives  Kunen’s  result  that  the  existence  of  0#  is  equivalent  to  the  existence  of 
ay:  L < L,  and  a detailed  analysis  of  Solovay’ s 0^,  the  existence  of  which  is  a 
principle  of  transcendence  over  inner  models  of  measurability. 
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16.  Saturated  Ideals  I 

Addressing  another  aspect  of  Lebesgue  measure,  Solovay  derived  the  principal 
results  on  its  extendibility,  in  the  presence  of  AC,  to  a measure  on  all  sets  of 
reals.  This  is  real-valued  measurability  < 2K°,  the  other  branch  of  the  bifurcation 
found  by  Ulam  in  his  fundamental  paper  [30]  on  Banach’s  measure  problem  (§2). 

16.1  Theorem  (Solovay  [66,71]). 

(a)  If  k is  a measurable  cardinal,  then  there  is  a p.o.  P such  that 

k = 2k°  a k is  real-valued  measurable  . 

(b)  If  k is  a real-valued  measurable  cardinal,  then  there  is  an  inner  model  of 
ZFC  in  which  k is  measurable. 

(a)  is  established  in  the  next  section  (17.5);  the  original  proof  of  (b)  is  sketched 
in  the  setting  of  inner  models  of  measurability  (20.4);  and  a sharp  version  of  (b)  is 
established  much  later  (see  Precipitous  Ideals  in  volume  II).  Solovay’ s paper  [71] 
was  particularly  influential,  for  in  establishing  16.1  not  only  did  it  broaden  the 
study  of  large  cardinal  properties  from  ultrafilters  to  ideals,  but  it  also  described 
how  forcing  and  ultrapowers  can  be  combined  in  a useful  technique  now  known 
as  generic  ultrapowers.  Owing  to  various  developments  in  the  1970’s  generic 
ultrapowers  was  to  emerge  as  a standard  technique  of  wide  applicability,  and  it  is 
taken  up  against  a broader  backdrop  in  volume  II. 

What  played  a key  role  in  this  area  and  soon  became  a staple  feature  of  large 
cardinal  theory  is  the  concept  of  saturated  ideal,  formulated  and  studied  by  Tarski 
[45],  Building  on  the  §0  preliminaries  about  ideals,  for  I an  ideal  over  a set  S, 
set 

/*  = {X  c s I S - X e 1}  , and 
/+  = V{S)  - I . 

I*  is  the  dual  filter  of  /,  and  I+  the  collection  of  /"“-stationary  sets,  i.e.  those  sets 
X c S such  that  X (T  Z 0 for  any  Z e I*.  A e I+  iff 

I\A  = {X  c S I in  A e 1} 

is  a (proper)  ideal,  the  restriction  of  I to  A or  the  ideal  generated  by  I U [S  — A}. 
Intuitively,  I,  I+,  and  I*  are  the  collections  of  negligible,  non-negligible,  and  all 
but  negligible  subsets  of  S with  respect  to  I . For  S a regular  cardinal  k > a>, 
the  main  example  is  I = NS*,  the  ideal  of  non-stationary  subsets,  when  I+  is  the 
collection  of  stationary  subsets  and  I*  the  closed  unbounded  filter  CK.  Of  course, 
the  theory  of  ideals  can  be  recast  entirely  in  terms  of  filters,  an  approach  consistent 
with  the  modem  preference  for  ultrafilters  over  maximal  ideals.  However,  like  the 
choice  of  p < q in  forcing  for  p is  stronger  than  q,  the  ideal  formulation  is  more 
natural  because  of  the  interplay  with  Boolean  algebras. 
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Turning  to  Tarski's  concept,  for  an  ideal  7, 

7 is  X-saturated  iff  for  any  {Xa  \ a < 1}  C /+  there  are 
/3  < y < X such  that  Xp  n Xy  e I+  , 

and 

sat(7)  = the  least  X such  that  7 is  ^-saturated  . 

sat(7)  is  a measure  of  how  close  7 is  to  being  a maximal  ideal,  with  sat(7)  = 2 
iff  I is  maximal.  Erdos-Tarski  [43]  established  that  sat(7),  if  infinite,  is  always  a 
regular  uncountable  cardinal. 

Only  ideals  7 over  cardinals  k will  be  considered  in  this  section,  for  which 
by  a §0  convention  a — {£  | £ < a}  e I for  every  a < k.  Furthermore,  for 
the  study  of  large  cardinal  properties  analogous  to  measurability  only  ideals  of 
requisite  completeness  will  be  considered: 

By  ideal  over  k is  meant  a: -complete  ideal  over  k . 

It  is  simply  to  see  that  for  such  ideals  to  exist  k must  be  regular. 

Saturation  of  ideals  is  a rather  general  concept;  for  example,  the  ideal  [/r]<'c 
over  k is  not  k-saturated  iff  there  is  a family  F oiX  almost  disjoint  subsets  of  k: 
F C \jcf  with  |F|  = X such  that  X f Y both  in  F implies  that  |X  fl  Y\  < k.  An 
early  result  of  combinatorial  set  theory  is  that  such  a family  exists  for  X = k+  by 
a simple  recursive  construction  (Sierpinski  [28]),  so  that  [k-]<,c  is  not  k+- saturated. 
The  connection  with  real-valued  measurability  is  provided  by  2.1: 

16.2  Exercise.  If  m is  a K-additive  measure  over  k,  then  [A  C k \ m (X)  = 0}  is 

an  coi-saturated  ideal  over  k.  H 

Solovay  showed  that  only  this  property  is  all  that  is  needed  to  establish  the 
large  cardinal  consequences  of  real-valued  measurability.  Beyond  this,  the  study 
of  saturated  ideals  led  to  a basic  result  about  stationary  sets  and  soon  developed 
into  an  important  area  of  research  in  its  own  right.  This  section  establishes  the 
basic  combinatorial  results  about  saturated  ideals  and  the  next,  the  first  relative 
consistency  results. 

Initial  observations  about  the  saturation  of  ideals  were  based  on  Ulam’s  fun- 
damental work  on  measurability. 

16.3  Exercise  (Ulam  [30]).  For  any  X,  there  is  no  X+ -saturated  ideal  over  A+. 

Hint.  Check  that  the  proof  of  2.4  with  Ulam  matrices  works.  H 

The  following  proposition  is  established  using  transfinite  versions  of  Ulam’s 
tree  argument  for  2.5(a);  its  (c)  affirms  7.12  by  16.2. 

16.4  Proposition.  Suppose  that  I is  an  ideal  over  k. 

(a)  (Tarski  [45:  53])  If  I is  X-saturated  where  2<x  < k,  then  k is  measurable. 
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(b)  (Levy,  Silver)  If  I is  K-saturated  and  k is  weakly  compact,  then  k is 
measurable. 

(c)  If  I is  X-saturated  for  some  X < k,  then  k has  the  tree  property. 

Proof.  For  (a)  or  (b),  suppose  first  that  I has  an  atom,  i.e.  a set  A e I+  such 
that  whenever  A = B U C is  a disjoint  union,  either  Bel  or  C e I . Then 
{I  c jc  | X n A e /+)  is  a k -complete  ultrafilter,  and  hence  k is  a measurable 
cardinal.  So,  we  can  assume  to  the  contrary  that  I has  no  atoms,  and  derive  a 
contradiction. 

For  this  purpose,  build  a tree  T ordered  by  D and  consisting  of  sets  indexed 
by  some  of  the  members  of  <K2  as  follows:  Set  Xf,  — k.  If  Xs  has  been  defined 
for  some  s e <K2,  then  define  Xs~(0)  and  ^-(l)  exactly  when  Xs  e I+,  in  which 
case  they  are  to  satisfy 

Xs~( o>,  Afj-(i)  e I+  , Xs  — Xs~(o)  U Xs~<i>  , and  Xj~(o)  Li  = 0 . 

That  I has  no  atoms  makes  this  possible.  Finally,  if  S < k is  a limit  and  s e s2, 
define  Xs  = P|a<i5  Xs\a  exactly  when  Xs\a  has  been  defined  for  each  a < S.  The 
tree  T thus  built  has  the  following  property: 

(*)  If  y < k and  s e Y2,  then  the  collection  of  “offshoots” 

{-^C?|ck~(/)  | a < y A Xs|„+i  is  defined  A s(or)  f /'}  is  pairwise  disjoint . 

For  (a),  (*)  and  A-saturation  imply  that  T has  height  at  most  X.  But  then,  k 
is  a union  of  2<x  < k sets  in  /,  namely  those  X s ’ s without  tree  successors.  This 
contradicts  the  k -completeness  of  I . 

For  (b),  note  that  if  T had  height  less  than  k,  then  by  the  inaccessibility  of  k 
and  the  above  argument,  the  k -completeness  of  I would  again  be  violated.  Thus, 
it  can  be  assumed  that  T is  a at -tree,  and  so  by  weak  compactness  it  has  a cofinal 
branch.  But  by  (*)  this  contradicts  the  /e-saturation  of  I . 

To  establish  (c),  suppose  that  (T,  <T)  is  a at -tree;  it  can  be  assumed  that 
T = k.  For  £ < k set  S$  — {£  < k \ % <t  K j,  and  let  U — < k \ e I+}. 
Then  % e U and  f <T  § implies  that  f e U.  It  is  simple  to  see  that  the  subtree 
determined  by  U has  height  k because  of  the  k -completeness  of  /,  and  that  each 
of  its  levels  has  cardinality  less  than  X by  k-saturation.  7.9  then  implies  that  this 
subtree  has  a cofinal  branch,  which  is  also  a cofinal  branch  through  {k,  <-/-).  H 

The  hypothesis  of  (c)  cannot  be  weakened  to  I is  /(--saturated  (Kunen  [78]). 
It  follows  from  (a)  that  the  theory  of  A.- saturation  over  k essentially  reduces  to 
that  of  measurability  unless  X > a>  and  2<x  > K.  Having  saturated  ideals  over  k 
does  not  require  the  strong  inaccessibility  of  k as  evidenced  by  16.2,  and  subject 
to  some  restrictions  like  16.3  there  could  be  saturated  ideals  low  in  the  cumulative 
hierarchy. 

Since  an  ideal  I over  k is  trivially  (2^)+ -saturated,  the  substantive  cases  for 
A-saturation  are  for  X in  the  range  a>  < X < 2K . In  fact,  the  theory  quickly  de- 
volved into  three  cases:  k+ , k,  and  </c  saturation.  In  what  follows,  Solovay’s 
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results  are  analyzed  by  progressively  considering  these  cases  in  turn,  deriving  con- 
sequences of  measurability  which  are  really  consequences  of  these  restricted  hy- 
potheses. Although  restricted  as  combinatorial  properties,  it  should  be  mentioned 
at  the  outset  that  even  k+ -saturation  has  the  consistency  strength  of  measurability 
(see  remarks  at  the  beginning  of  §17).  ^.-saturation  for  k+  < X < 2K  has  been 
studied  in  Baumgartner  [76:  §5]  and  more  extensively  in  Baumgartner-Taylor  [82]; 
interesting  questions  remain,  but  large  cardinals  are  generally  not  involved. 

While  /e-saturation  amounts  to  the  assertion  that  no  subset  of  k can  be  par- 
titioned into  k disjoint  I+  sets,  k+ -saturation  has  no  such  intuitive  feel.  What  is 
significant  is  the  completeness  of  the  quotient  Boolean  algebra  V(k)/I.  This  is 
the  Boolean  algebra  consisting  of  equivalence  classes  [X\i  for  X c k,  where 

[X],  = [r]j  iff  XAYel. 

As  usual,  the  subscripts  are  suppressed  when  clear  from  the  context.  The  Boolean 
operations  are  the  set-theoretic  ones  modulated  by  ideal  properties: 

[X]  + [F]  = [X  U Y]  , [X]  ■ [F]  = [X  n F]  , and  - [A]  = [k  — X]  . 

Consequently, 

[X]<[Y]  iff  X-Yel  , 
and 

0 = [0]  = / and  1 = [k]  = I* 

are  the  zero  and  one  elements  of  the  Boolean  algebra.  (/+,  c)  is  later  considered 
as  a notion  of  forcing  (see  Precipitous  Ideals  in  volume  II).  Since  then  p A q iff 
p n q e /,  the  separative  quotient  /+/»  is  just  V{k)/I  — [0[.  In  this  guise  the 
X-saturation  of  I is  equivalent  to  the  X-c.c.  of  the  forcing  notion. 

The  following  was  the  first  significant  result  about  k+ -saturation.  Recall  that 
a Boolean  algebra  B is  complete  iff  for  any  S c If  the  least  upper  bound  J2  S 
exists  in  the  sense  of  B. 

16.5  Proposition  (Smith-Tarski  [57:254]).  If  I is  a k+ -saturated  ideal  over  k, 
then  V(k)/I  is  a complete  Boolean  algebra. 

Proof.  Set  B — V{k)/I  and  let  S c B be  arbitrary.  The  proof  that  S exists 
proceeds  progressively  through  three  cases  depending  on  |S|: 

Case  (i):  |S|  < k.  With  S = {[A„]  | a < y}  where  y < k it  follows  that 
T.S  = [U„<Kzo']:  If  [T]  > [A„]  for  every  a < y,  then  (U„<y  x<*)  ~ Y = 
U a<y(xa  -Y)e  I by  /(-completeness,  and  so  [F]  > [IJ«<K  Pl- 
ease (ii):  |S|  = k.  Let  S — {ba  \ a < /c],  and  using  (i)  set  aa  — ba  — bt= 
for  a < k.  Then  straightforward  arguments  show  that: 

(a)  if  a < f < k,  then  aa  ■ ap  — 0 ; 

(b)  if  a < k,  then  ^<a  a*  — h i ancJ  hence 

(c)  if  Y,a<K  a«  exists>  then  S0  doeS  ba  and  T,a<K  a“  = T.a<K  b«  ■ 
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We  can  now  work  with  the  af s instead,  which  can  be  assumed  to  be  nonzero. 
Extend  {aa  | a < k]  to  a maximal  family  A C B such  that  x f y both  in  A implies 
that  x / 0 yet  x-y  = 0.  By  k+ -saturation  | A\  < k,  so  let  A = {[F5]  | S < at}.  The 
Y$’s  can  be  taken  to  be  pairwise  disjoint.  (Replace  each  Yg  by  Yg  — U^<i  ^ = 
Yg  — n F$),  noting  that  the  subtracted  set  is  in  / by  k -completeness.) 

Finally,  set  D — (J{ | [F<j]  = aa  for  some  a < k}\  then  [Zl]  = fZa<K 

Suppose  that  [Zs]  > aa  for  every  a < k.  To  show  that  [E]  > [D],  set 
b = [D]  — [£].  Certainly  b ■ aa  = 0 for  every  a < k by  hypothesis  on  [£]. 
Also,  for  [Ti]  not  an  aa,  Yg  D D — 0,  and  so  b ■ [Fa]  = 0.  Hence,  b = 0 by  the 
maximality  of  A. 

Case  (iii):  .S'  > k.  It  can  be  assumed  inductively  that  if  T C S with 
\T\  < |5|,  then  T exists.  Let  S — {ba  \ a < A},  and  using  the  inductive 
hypothesis,  set  aa  = ba  — bt=  for  a < X.  Then  (a)-(c)  of  (ii)  hold  with  k 
replaced  by  X.  But  by  k + -saturation  at  most  k of  the  «„’s  are  non-zero,  and  so 
by  (ii)  aa  = ba  exists.  H 

This  result  has  a partial  converse: 

16.6  Exercise  (Solovay  [71:427]).  Suppose  that  I is  an  ideal  over  k such  that 
V(k)/I  is  complete.  Then  for  any  X such  that  I is  not  X-saturated,  2X  < 2K . In 
particular: 

(a)  1 is  2K  -saturated. 

(b)  If  2K  < 2K  , then  I is  k+ -saturated. 

Hint.  Suppose  that  {ba  \ a < X}  C.  V{k)/I  is  a counterexample  to  k-saturation. 
For  X c X set  ax  — and  note  that  if  X f Y,  then  ax  / aY.  But 

\V(k)/I\  < 2K.  H 

The  completeness  of  V(k)/I  is  itself  a substantial  hypothesis;  see 
Kanamori-Shelah  [95], 

Solovay  used  16.5  to  show  that  /c+-saturation  is  enough  to  “normalize”  ideals 
in  analogy  to  5.11  for  measurability.  To  establish  some  further  terminology,  for 
I an  ideal  over  k,  X c k,  and  /:  X — > k, 

f is  I-small  on  X iff  {£  e I | /(§)  — a}  e I for  every  a < k , 
f is  1-incompressible  on  X iff  f is  /-small  on  X yet  for  any  Y C.  X 

with  Y e I+  and  g:  Y ->  k with  g(f)  < f(f  ) 
for  | e F,  g is  not  /-small  on  F , and 
I is  normal  iff  I*  is  a normal  filter  , 

i.e.  for  any  X e I+  and  / regressive  on  X,  there  is  a F c X with  Y <=  I+  such 
that  / is  constant  on  F. 

16.7  Proposition  (Solovay  [71]).  Suppose  that  I is  a k+ -saturated  ideal  over  k. 
Then: 


16.  Saturated  Ideals  I 


215 


(a)  For  any  X e I+  there  is  an  I -incompressible  function  on  some  Y C X 
with  Y e /+. 

(b)  There  is  an  I -incompressible  function  on  k. 

(c)  If  f is  an  I -incompressible  function  on  k,  then 

/*(/)  = [XcK  | f~\x)el) 

is  a normal  k+ -saturated  ideal  over  k. 

Proof,  (a)  Assume  to  the  contrary  that  this  fails  for  some  X e I+.  The  following 
is  first  established: 

(*)  For  any  f /-small  on  X,  there  is  an  / /-small  on  X 
such  that  m eX  \ 7(f)  < /($)}]  = [X]  . 

Given  such  an  /,  note  that  by  assumption  whenever  Y c X with  Y e I+,  there 
is  a Z c F with  Z e I+  and  a g /-small  on  Z such  that  g(f)  < /(f)  for  f e Z. 
Flence,  by  taking  an  appropriately  maximal  family  C V(X)  f!  /+  and  invoking 
Ar+-saturation,  there  is  a collection  {(Z„,  ga ) \ a < /c}  satisfying: 

(i)  a < / < k implies  that  Za  c X with  Za  e I+  yet  Za  D Zp  e I ; 

(ii)  ga  is  /-small  on  Za  and  ga(f ) < /(f)  for  f e Z„  ; and 

(iii)  E„<,[Z„]  = m ■ 

As  for  16.5,  the  Z„’s  can  be  taken  to  be  pairwise  disjoint  (by  replacing  Za  with 
Za  ~ L4<a  Z|).  Now  let  / be  any  extension  of  ga  to  all  of  X.  Then  it 
is  simple  to  see  that  / satisfies  the  conclusion  of  (*).  For  example,  / must  be 
/-small  on  X,  else  if  {f  e X \ /(f)  = y]  e I+  for  some  y < k,  then  by  (iii) 
(f  e Z„  | gad;)  = /(f)  = y}  e I+  for  some  a < k,  contradicting  (ii). 

With  (*)  we  can  readily  derive  a contradiction  as  follows:  Let  fo  be  the 
identity  map  on  X,  and  for  new  inductively  set  fn+\  — fn  as  provided  by  (*). 
Setting 

X„  — d e X \ /,+1(f)  < /„(f)}  , 

[A„]  = [X]  implies  that  X — Xn  e I and  hence  X — p|n  X„  el,  so  that  f~)n  Xn  e 
I+.  But  any  f in  this  set  corresponds  to  a descending  sequence  of  ordinals 

/o(f)  > /i(f)  > /2(f)  > •••  . 

(b)  By  taking  an  appropriately  maximal  family  and  invoking  k+ -saturation, 
it  follows  from  (a)  that  there  is  a collection  {(F„,  fa)  I a < k}  satisfying 

(i)  a < / < k implies  that  Ya  £ I+  yet  Ya  n Yp  e I , 

(ii)  fa  is  /-incompressible  on  Ya  , and 

(iii)  E„<*[r«]  = 1 • 

Again  the  Fff’s  can  be  taken  to  be  pairwise  disjoint.  Then  any  extension  of 
Ua<K  fa  to  all  of  k is  an  /-incompressible  function  on  k. 

(c)  This  is  straightforward.  H 
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The  strengthening  to  k -saturation  opens  the  door  to  results  about  the  large 
size  of  k analogous  to  those  for  measurability.  The  proof  of  (c)  below  was  based 
on  the  original  Keisler-Tarski  one  of  5.14.  Recall  that 

M(X ) = {a  e X | X fl  a is  stationary  in  a]  , 

Mahlo’s  operation,  (e)  was  also  obtained  by  Jensen  and  in  part  by  Fremlin 
(cf.  Solovay  [71:397]). 


16.8  Proposition  (Solovay  [71]).  Suppose  that  1 is  a normal  K-saturated  ideal 
over  k.  Then: 

(a)  If  X e I+  and  f is  regressive  on  X,  then  there  is  a Y C X with  [T]  = [X] 
and  a y < k such  that  f“Y  C y . 

(b)  {a  < k | a is  regular]  e I*. 

(c)  If  S c k is  stationary,  then  [a  < k \ S fl  a is  stationary  in  a } e I*. 

(d)  For  any  X C k,  [M(X)]  = [X], 

(e)  k is  k -weakly  Mahlo. 

Proof,  (a)  Set  Xa  = {£  e X | /(£)  = a]  for  a < k,  and  E — {a  < k \ Xa  e 
I+}.  By  /r -saturation  \E\  < k,  and  so  y — suplli)  < k.  Hence,  we  can  take 
Y — Uo;e£  since  X — Y el  by  a simple  application  of  normality. 

(b)  Assume  to  the  contrary  that  {a  < k | cf(a)  < a}  e I+,  so  that  by 
normality  A — [a  < k | cf(a)  — 8}  e I+  for  some  8 < k.  For  each  a e A,  let 
(??"  | £ < 8)  be  cofinal  in  a,  and  then  for  each  f < 8 define  regressive  functions 
gi=  on  A by:  gt=(a)  = ))“.  By  (a),  for  each  § < 8 there  is  an  A|  C A with 
[A^]  = [A]  and  a y$  < k such  that  g|“A|  c yt.  Set  y = sup([y|  | f < <5})  < k\ 
By  a: -completeness  A|  e I+  as  in  previous  arguments  yet  P|?<5  A^  c y , 
which  is  a contradiction. 

(c)  Assume  to  the  contrary  that  W = [a!<K'|Snaris  not  stationary  in 
a } e I+.  Then  for  each  a e W there  is  a Ca  closed  unbounded  in  a such  that 
Ca  fl  S = 0.  Define  ff.  W ->  k for  § < k by: 


/?(«)  = 


£ th  element  of  Ca  if  there  is  one  , and 
0 otherwise . 


Then  is  regressive  on  W,  so  by  (a)  there  is  a W|  c W with  [JF^]  = [W]  and 
a yt=  < a:  such  that  ff‘W%  c y^. 

Let  D be  the  diagonal  intersection  of  the  JF^'s,  i.e. 


D — {a  e W \ a e . 

Then  [D]  = [IF]:  If  not,  {a  e W \ 3h(a)  < a(a  £ VF/,(a,))}  e I+,  and  so 
by  normality  {a  e W \ a £ } e I+  for  some  § < v;  but  this  contradicts 

[ ] = [IF].  Next  set  C = [a  < k \ a is  a limit  A < a(y$  < a)},  a closed 
unbounded  set. 
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To  conclude  the  argument,  since  S is  stationary,  let  ;/  e S IT  C,  and  £ e D 
with  £ > 77.  By  (b)  we  can  assume  that  f is  regular,  so  that  MO  is  the  77th 
element  of  Cf.  In  fact,  MO  = 77  since 

n < MO  = sup({/f(f)  I £ < 77})  < sup([yf  I § < 77})  < r]  . 

The  equality  holds  since  r / is  a limit  ordinal,  the  next  inequality  since  f e D,  and 
the  last  since  rj  e C.  Thus,  r]  = MO  e Cf  yet  ?;  e .S',  which  is  a contradiction. 

(d)  We  can  assume  that  X e I+;  X is  then  stationary  since  I*  is  normal  and 
thus  contains  every  closed  unbounded  subset.  As 

M(X)  = A fl  [a  < /c  | A fl  a is  stationary  in  a]  , 
the  result  now  follows  from  (c). 

(e)  This  follows  by  induction  from  (b),(d),  and  /e-completeness.  H 

As  with  weakly  compact  cardinals  (e)  can  be  extended  using  diagonal  inter- 
sections: {a  < k | a is  a-weakly  Mahlo}  e I*  and  so  forth. 

Solovay  used  16.8(d)  to  establish  a basic  result  about  stationary  sets  that  does 
not  involve  large  cardinal  hypotheses: 

16.9  Theorem  (Solovay  [71]).  Suppose  that  k > co  is  regular  and  X C k is 
stationary.  Then  X can  be  partitioned  into  k disjoint  stationary  sets,  i.e.  there  are 
pairwise  disjoint  stationary  sets  Xj,  for  f < k such  that  X — [J|<K  X^. 

Proof.  The  crux  of  the  matter  is: 

(*)  If  S c k is  stationary  and  k — S is  unbounded  in  k,  then  S — M(S)  = 

[a  e S | S fl  a is  not  stationary  in  a}  is  stationary  in  k . 

To  show  this,  suppose  that  C closed  unbounded  in  k.  Let  D be  the  set  of 
limit  points  of  C less  than  k,  and  E the  set  of  limit  points  of  k — S less  than 
k.  Then  D and  E are  both  closed  unbounded  in  k,  so  S fl  I)  fl  E f 0.  Let  a 
be  the  least  member  of  S fl  D fl  E.  If  cf(a)  = co,  then  since  a e E,  there  is  an 
unbounded  subset  of  a of  ordertype  co  disjoint  from  S fl  a.  If  cf(a)  > co,  then  it 
is  simple  to  see  that  D fl  E (T  a is  a closed  unbounded  subset  of  a disjoint  from 
S fl  a.  In  either  case,  a e (S  — M(S ))  fl  C. 

To  establish  the  theorem,  assume  by  taking  a subset  of  X if  necessary  that 
k — X is  unbounded  in  k.  If  A cannot  be  partitioned  into  k disjoint  stationary 
subsets,  then  it  is  simple  to  check  that  NS^|A,  the  ideal  of  non-stationary  subsets 
of  k restricted  to  X,  would  be  a normal  v-satu rated  ideal.  But  then,  (*)  for  .S'  = A 
would  contradict  16.8(d).  H 

This  result  has  become  a standard  tool  of  combinatorial  set  theory,  and  proofs 
have  been  devised  that  avoid  16.8(d)  (see  e.g.  Jech  [03:95]).  Nonetheless,  it  is 
a prominent  example  of  a basic  result  of  set  theory  first  found  in  the  context  of 
large  cardinals.  Another  is  Silver’s  1974  result  on  the  Singular  Cardinals  Problem 
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(see  volume  II).  These  results  further  attest  to  the  “pragmatic  success”  of  large 
cardinals;  techniques  developed  in  their  study  have  led  to  crucial  results  in  set 
theory  not  involving  large  cardinal  hypotheses. 

A basic  combinatorial  question  about  the  ideal  NS*  of  non-stationary  subsets 
of  k sets  was  soon  raised: 

Can  NS*-,  or  even  NSK|S  for  some  stationary 

SCk,  ever  be  K+-saturated  for  some  regular  k > col 

16.9  had  denied  k -saturation,  but  the  question  of  k+ -saturation,  particularly  for 
the  focal  case  k — u>\,  could  not  be  readily  resolved.  Results  soon  showed  that 
having  any  /c+-saturated  ideal  over  a successor  cardinal  k was  a very  substantial 
hypothesis,  and  the  possibility  at  co\  was  particularly  enticing.  Progressive  research 
steadily  clarified  the  relationship  between  ideals  and  large  cardinal  hypotheses,  and 
eventually  led  in  the  mid-1980’s  to  a positive  answer  to  the  question  (see  §32  and 
volume  II). 

What  about  partition  properties?  Using  <k  saturation,  Solovay  established 
a generalization  of  Rowbottom’s  result  7.17  on  normal  ultrafilters.  For  a filter  F 
over  a cardinal  k and  w < v < k, 

F is  v-Rowbottom  iff  for  any  /:  [at] <a'  ->  y with  y < k there 
is  an  X e F such  that  \f“[X]<0>\  < v . 

F is  Rowbottom  iff  F is  a>\ -Rowbottom  . 

This  filter  amplification  of  v-Rowbottom  cardinals  is  analogous  to  the  relationship 
of  normal  ultrafilters  to  Ramsey  cardinals.  A simple  observation  is  made  before 
turning  to  the  theorem: 

16.10  Exercise.  Suppose  that  I is  a v-saturated  ideal  over  k where  v < k.  If 

f\K—*y  where  y < k and  E = {£  < y | /-1({£})  e /+},  then  E 0,  \E\  < v, 
and  1)  e !*■  H 

16.11  Theorem  (Solovay  [71]).  Suppose  that  I is  a normal  v-saturated  ideal  over 
k where  v is  regular  and  a>  < v < k.  Then  I*  is  a v-Rowbottom  filter. 

Proof.  It  suffices  to  establish  the  following  for  every  n e co:  for  any  /:  [at]"  ->  y 
with  y < k there  is  an  X e I*  such  that  |/“[X]"|  < v. 

Proceeding  by  induction,  the  case  n = 1 follows  from  16.10.  So,  assume  that 
the  statement  holds  for  n,  and  suppose  that  /:  [/c]"+1  — > y with  y < k.  Define 
fa-  M"  -»  y for  each  a < k as  follows: 

{/({a}  Us)  if  a < mints)  , and 

0 otherwise . 

By  the  inductive  hypothesis,  for  each  a < k let  Ha  e I*  be  such  that  fa“[Ha]”  = 
Sa  with  | Sa|  < v. 

We  now  whittle  down  further.  First,  let  g:  k -»  v be  given  by  g(a)  = |Sa|. 
By  16.10,  let  Wei*  such  that  \g“W\  < v.  If  S — sup(g“W),  then  S < v by  the 
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regularity  of  v.  For  each  a e W,  let  ha:  8 — »■  Sa  be  a surjection.  Then  define 
gf.  W — > y for  each  f < 8 by:  g* (a ) = ha(f).  Again,  for  each  £ < 8 there 
is  a W|  c W with  Wj  e I*  such  that  < v.  By  the  regularity  of  v,  if 

E = Uf<«  8i“W. f,  then  \E\  < v. 

Finally,  set  X = P|^<4  n A a<KHa.  X e I*  by  normality,  so  to  complete 
the  proof  it  suffices  to  show  that  /“[X]"+1  c E.  Suppose  then  that  t e [Z]n+1, 
written  t — {a}  U s where  a < min  A).  Then  f(t)  — fa(s)  e Sa  as  s e [Ha]n, 
and  as  a e it  follows  that  Sa  c E.  H 

16.12  Corollary  (Prikry  [75]). 

(a)  Suppose  that  there  is  a v-saturated  ideal  over  k,  where  a>  < v < k and 
2<v  — k.  Then  for  v < X < k,  2X  = k. 

(b)  If  2K°  is  real-valued  measurable,  then  2f  — 2^°  for  eveiy  X < 2So. 

Proof,  (a)  If  v is  regular,  the  result  follows  from  16.11  and  8.9.  If  v is  singular, 
then  2V  — k (see  the  first  paragraph  of  the  proof  of  8.9);  but  v < k implies  v+  < k 
by  16.3,  and  so  just  v+-saturation  implies  the  result  by  16.11  and  8.9.  (There  is 
another  argument  appealing  to  the  Erdos-Tarski  [43]  result  that  the  least  X such 
that  some  ideal  is  ^.-saturated  is  always  regular.) 

(b)  This  follows  from  (a).  H 

(b)  was  Prikry’s  main  objective;  his  argument  was  factored  through  8.9.  The 
conclusion  of  (b)  is  also  a consequence  of  Martin’s  Axiom,  a hypothesis  known 
to  be  inconsistent  with  the  real-valued  measurability  of  2K°. 

The  analogy  with  measurable  cardinals  was  further  extended  by  Kunen  (see 
Solovay  [71]),  who  established  that  a cardinal  k carrying  a v-saturated  ideal  for 
some  v < k has  a kind  of  77j2-indescribability  property. 
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17.  Saturated  Ideals  II 

This  section  is  devoted  to  the  first  relative  consistency  results  about  saturated  ideals 
obtained  by  forcing.  Solovay  was  led  to  16.11  so  that  he  could  establish  16.1(b) 
by  adapting  an  argument  of  Silver’s  from  measurability  - see  20.3  and  20.4. 
Soon  afterwards  Kunen  [70]  using  iterated  ultrapowers  showed  that  k+ -saturation 
sufficed  for  a comprehensive  result:  If  I is  a normal  k+ -saturated  ideal  over 
k,  then  k is  measurable  in  L\I],  the  class  of  sets  relatively  constructible  from  I . 
Moreover,  in  L[I\  the  dual  filter  to  I i T L[I ] is  a normal  ultrafilter  over  k.  A 
refined  version  of  this  result  will  be  established  later  (see  Precipitous  Ideals  in 
volume  II).  What  is  pertinent  for  us  now  is  that  the  forcing  arguments  below  all 
assume  the  existence  of  a measurable  cardinal  in  the  ground  model,  an  assumption 
both  necessary  as  well  as  sufficient. 

An  early  forcing  result  of  Prikry  spurred  the  initial  work  in  this  area.  He 
showed  that  merely  chain  conditions  suffice  to  preserve  saturation,  and  thus  satu- 
rated ideals  are  resilient  under  a variety  of  generic  extensions.  The  following  is 
an  extension  of  his  result. 

17.1  Theorem  (Prikry  [66,70]  for  X < k,  Solovay  [71:411]  for  7.  = k,  and  Kakuda 
[72]  for  X — k+).  Suppose  that  I is  a X-saturated  ideal  over  k where  X < k+  is 
regular,  and  P is  a p.o.  with  the  v-c.c.  where  v < k and  v < X.  Then 

|| -PI  generates  a k-saturated  ideal  over  k . 

Proof.  Let  ||-  7 = [X  c k | 37  e I(X  c T)},  the  ideal  generated  by  I.  Only 
v < k is  needed  to  show  that  ||-  I is  /r-complete:  It  suffices  to  show  that  if  p < k 
and  p ||-  {Xa  | a < p]  c 7,  then  there  is  an  A e I such  that  p ||-  |Ja  Xa  c A. 

To  do  this,  for  each  a < p let  Qa  be  a maximal  antichain  below  p such  that  for 

each  q e Qa  there  is  an  A„  e I such  that  q || - Xa  = Aqa.  Since  \Qa\  < v < k by 
the  v-c.c.,  Aa  = |J {A%  \ q e Qa } e I.  Thus  also  A — LX<p  Aa  e 7,  and  it  is 
simple  to  see  that  p ||-  IJo,^  ^ A. 

Turning  to  k-saturation,  suppose  that  p |f-  {Ya  \ a < A]  c /+.  The  proof  is 
completed  by  finding  a q < p and  a < f < X so  that  q ||-  Ya  fl  Yp  e /+: 

For  each  a < X let 

= {£  < k I 3q  < p(q  II-  £ e Ya)} . 

Then  Aa  e I+  since  p ||-  Ya  c Aa.  Hence,  there  must  be  an  A e I+  satisfying: 

(*)  VW(W  c A A W e 7+  ->  |{«  < X \ W 0 Aa  e 7+}|  = X)  . 

This  follows  directly  from  k-saturation  and  the  regularity  of  X:  Let  {Wy  \ y < 
p } c 7+  be  maximal  such  that  y < S < p implies  that  Wy  fl  Ws  e 7 and 
| [a  < X | WY  fl  Aa  e 7+}|  < X.  Then  p < X,  and  so  it  is  readily  seen  that  A — Ap 
must  satisfy  (*)  for  any  sufficiently  large  fi  < X. 
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With  (*),  sets  {Stl  \ rj  < v}  C.  [L]<A-  and  {Ba  \ a e Sq}  can  be  defined 
recursively  so  that  for  ;/  < f < v, 

(i)  s„  n S(  = 0 , 

(ii)  if  [a,  0}  e [5"^]2,  then  Ba  c Aa  n A and  Ba  e 7+ 
yet  Bu  fl  Bp  — 0 , and 

(iii)  [A]  = Zaes„lBa]  ■ 

At  stage  i]  < v of  the  recursion,  | [J^  Sf|  < X since  v < X and  X is  regular,  and 
so  with  (*)  find  Stl  c X — |Jj  S%  and  {Ba  \ a e Stl}  satisfying  (iii)  such  that 
for  {a,  /J}  e [S',,]2,  Ba  c Aa  fl  A and  Ba  e 7+  yet  Ba  n Bp  e I.  |5,|  < X by 
1-saturation,  and  so  the  Ba’ s can  be  taken  to  be  pairwise  disjoint  by  replacing  Ba 
with  Ba  — IJfB/j  | P < a A fi  € Stl}  as  usual. 

Let  A'  = n„<v  Uaes  So,  so  that  by  (iii)  and  /(--completeness,  [A']  = [A] 
and  thus  A'  e I+.  By  (ii),  for  each  ( e A'  and  17  < v there  is  exactly  one  a e S,, 
such  that  £ e Ba  c A„.  By  definition  of  Att  let  qrJ  < p such  that  q'J  ||-  f € Ya. 
Note  that  by  the  v-c.c.,  for  each  (eA'  there  must  be  < v and  an  r|  e P 

such  that  . (Note  that  if  X < k,  we  could  have  taken  the  F„’s  to  be 

forced  disjoint  and  finish  the  argument  here  with  a contradiction,  since  r $ forces 
f to  be  in  two  different  Ya’ s.)  By  /(-completeness  and  v < k,  there  must  be  fixed 
r]  < t;  < v and  a B c.  A'  with  B e I+  such  that  for  § e B,  (rj^,  fj)  = (q,  f). 
Finally,  by  (i),  (iii),  and  the  completeness  of  V(k)/I  there  are  a < fi  such  that 
heS,  and  fi  e and  Z — Ba  H Bp  D B e I+. 

All  of  the  foregoing  was  to  establish  that  there  are  Z e 7+  and  a < fi  < X 
satisfying: 

for  each  (eZ,  there  is  an  r j < p such  that  rj  ||-  £ e Ya  fl  Yp  . 

The  rest  is  simple.  Let  Q c P be  a maximal  antichain  below  p such  that  for 
each  q e Q there  is  an  Zq  e I such  that  q ||-  Ya  fl  Yp  c Zq.  Since  \Q\  < v < k, 
Z — U qeQ  Zq  e 7+.  For  any  § in  this  set,  note  that  must  be  incompatible  with 
every  member  of  Q.  Thus,  ||-  Ya  fl  Yp  e 7+  to  complete  the  proof.  H 

This  result  does  not  extend  to  the  k-c.c.  since  the  Levy  collapse  Col(&>,  k)  of 
an  uncountable  regular  cardinal  k to  co\  has  the  k-c.c.  yet  there  are  no  «i-saturated 
ideals  over  a>\  by  16.3.  For  /c+-saturated  ideals  over  k,  Laver  [82]  and  later 
Baumgartner-Taylor  [82a]  characterized  those  k-c.c.  P for  which  the  conclusion 
holds.  17.1  is  subsumed  by,  and  its  proof  derived  from,  this  characterization.  The 
case  X < k led  to  the  first  consistency  result  on  saturated  ideals: 

17.2  Corollary  (Prikry  [66,70]).  Suppose  that  k is  a measurable  cardinal.  Then 
there  is  an  coi-c.c.  p.o.  P such  that 


||- P 2*°  = K A 3/(7  is  an  a>\ -saturated  ideal  over  k)  . 
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Proof.  Let  P be  the  p.o.  for  adding  k Cohen  reals,  i.e. 

P — {p  | p is  a finite  function  A dom(p)  C k x co  a ran(/;)  C 2}  . 

It  is  well-known  that  P has  the  ®i-c.c.  (by  the  A-system  Lemma  10.4)  and  that 
|[p  2k°  = k (using  10.5  in  one  direction).  By  17.1  any  ideal  dual  to  a /c-complete 
ultrafilter  over  k generates  an  co\ -saturated  ideal  over  k in  the  generic  extension. 

H 


Prikry  [70]  verified  that  in  this  generic  extension,  k is  not  real-valued  mea- 
surable. Solovay  had  originally  been  inspired  by  Prikry’s  work  to  show  that  after 
“adding  k random  reals”  instead,  k does  remain  real-valued  measurable.  This  is 
the  result  established  next,  after  a discussion  of  the  forcing. 


Product  Measure  Forcing 

For  any  infinite  cardinal  k,  we  consider  a product  measure  for  K2  that  recalls 
Lebesgue  measure  as  formulated  in  §0.  For  each  function  s with  dom(s)  e [/c]<<M 
and  ran(s)  C 2,  set 

0(s)  = {/  e *2  | / 2 s]  . 

K2  is  topologized  by  taking  these  0(i)’s  as  the  basic  open  sets.  Specify  that 
Bk  is  the  a -algebra  on  K2  generated  by  the  GOO's. 


To  endow  BK  with  a measure,  let  mi  be  the  measure  on  V(2)  specified  by:  W2(0)  = 
0,  m2({0})  = ({ 1 })  = and  W2({0,  1})  = 1.  Then  stipulate  that 

mK  is  the  product  measure  on  BK  induced  by  ni2- 

mK(0(s ))  = 2-|s|  for  every  s,  and  mK  is  the  unique  measure  on  BK  with  this 
property. 

Consider 

B*k  = {Xe  Bk  | mK(X)  > 0} 


ordered  by 


P <q  iff  P^q 


as  a p.o.  for  forcing.  The  following  recalls  11.7. 


17.3  Exercise. 

(a)  If  p,q  e B*,  then  p X q iff  mK(p  (T  q)  = 0. 

(b)  B*  has  the  coi-c.c.  H 

B*  is  not  separative,  and  its  separative  quotient  B*/&  is  formulated  with 
p ^ q iff  mK(p  A q)  — 0.  Using  this  as  a definition  to  extend  » to  all  of  BK, 
Bk  / ~ is  a Boolean  algebra  with  the  Boolean  operations  being  the  set-theoretic 
ones  modulated  by  This  algebra  is  analogous  to  V(k)/I  discussed  previously, 
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with  the  corresponding  ideal  being  {X  e BK  \ mK(X)  — 0}.  By  a simple  version 
of  the  proof  of  16.5,  the  &>i-c.c.  implies  that  BK/&  is  a complete  Boolean  algebra. 

Proceeding  as  in  §11,  we  can  develop  codes  for  appropriate  B*  sets,  show 
that  forcing  with  B*  amounts  to  adding  a new  function:  k ->  2 analogous  to  a 
random  real,  and  provide  characterizations  like  11.9  and  11.10.  See  Kunen  [84] 
for  details.  The  next  proposition  shows  how  forcing  with  B*K  affects  the  size  of 
the  continuum. 

17.4  Proposition.  ||-g^2Ko  > k 1,  and  if  /cK°  = k,  then  ||-b^2Ko  = f . 

Proof.  For  each  limit  ordinal  y < k let  fy  be  a name  satisfying  ||-  fY:  a>  —>  2 
such  that  for  any  n e co  and  i <2, 

\Yfy{n)  = i ^ 0({(y  + n,  /)})  e G 

(where  G is  the  canonical  name  for  the  generic  object).  To  show  that  ||-  2K°  > k, 
it  suffices  to  show  that  \\-  fy  f f$  for  limit  ordinals  y <8  < k: 

Let  p e B*k  be  arbitrary.  Choose  k e co  so  large  that  2~k  < mK(p ).  For  each 
t e k2,  let 


t:  {y,  y + 1, ...,  y +k  - 1,5,5+  1,  ...,8  + k-  1}  2 

satisfy  l(y  + i)  — 1(8  + i)  — t(i)  for  each  i < k.  If  q = [J fgt2  0(1),  then 
mK(q ) = 2k(2~lk)  — 2~k . Thus,  mK(p  — q)  > 0,  and  a standard  forcing  argument 
shows  that  p — q ||-  fy\k  f f$\k. 

If  atk°  = k,  note  that  an  analysis  of  BK  in  a hierarchy  of  length  co\  as  for  the 
Borel  hierarchy  for  "w  shows  that  \BK\  = Kj  • /rK°  = k.  Flence  by  the  &>i-c.c.  and 
10.5,  |h  2K»  < k.  H 

For  any  y < k,  the  map  ey:  co  -*  k given  by  ey(n)  = y + n induces  an 
injective,  complete  embedding:  B*  ->  B*.  Flence  by  10.1(a),  if  G is  £>*-generic, 
then  efx  (G)  is  B*0 -generic.  Moreover,  for  limit  y these  generics  are  determined  by 
the  fY  of  the  previous  proof,  which  can  thus  be  considered  random  reals.  (In  §11, 
random  reals  were  defined  via  B*  c V^co)  whereas  B*0  c V(w2).  However,  there 
is  a natural  homeomorphism  between  “* co  and  ®2,  if  countably  many  points  are 
ignored,  that  preserves  measure  (see  e.g.  Levy  [79:  VII§3])  and  hence  randomness.) 
It  is  in  this  sense  that  forcing  with  B*  is  often  referred  to  as  “adding  k random 
reals”,  although  it  is  not  the  same  as  adding  k random  reals  “side-by-side”  with 
product  forcing  because  of  the  measure-theoretic  overlay. 

Solovay  established  his  result  16.1(a)  with  B*  using  Boolean-valued  models. 
Towards  a proof  of  his  result,  some  of  that  approach  is  adopted  in  the  assignment 
of  Boolean  values  in  a weak  sense  to  formulas  of  the  forcing  language  for  B*: 
For  any  formula  <p,  choose  a maximal  antichain  A c {p  e B*  \ p ||  (p).  A is 
countable  by  the  &>i-c.c.,  so  set 


M = U IP  e A | P \y<p]  e Bk 
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J<p]|  has  two  crucial  properties:  if  f 0,  then  |b  cp;  and  for  any  q e B*,  if 
q ||-  cp,  then  mK(q  — [[<p]])  = 0.  Although  |[<p]]  depends  on  the  choice  of  A,  only 
mK(^cp] |)  and  mK ([[<p]]  fl  p)  for  p e B*  will  be  used,  and  it  is  simple  to  check  that 
these  values  do  not  depend  on  A. 

The  following  result  establishes  16.1(a): 


17.5  Theorem  (Solovay  [66,71]).  Suppose  that  k is  measurable.  Then 

lb Bi  A k is  real-valued  measurable  . 

Proof.  By  17.4  it  suffices  to  establish  real-valued  measurability.  Let  U be  a k- 
complete  ultrafilter  over  k.  First  define  for  each  p e B*  a “measure”  \jlp  on  names 
X such  that  p |b  X C k by: 


pp(X)  = r 


iff 


{H  <K  I 


m,®exinp) 

r- ='■}£(/ 

mK(p) 


This  is  well-defined,  since  2K°  < k and  U is  /c-complete.  It  is  straightforward  to 
check  that 

(i)  If  X e U,  then  fip(X ) — 1;  if  X f U,  then  np(X)  = 0 . 

(ii)  If  p |b  X c Y C k,  then  pp(X ) < pp(Y)  . 

(iii)  If  y < k and  p |b  ^ X = IJIXq,  | a < y]  c k is  a 
pairwise  disjoint  union1,  then  pp(X)  — Yla<y  Pp(Xa)  ■ 

For  the  last,  note  that  for  any  § < k. 


»l(K  e X]  n p)  = mK{ UacyK  e n p)  = e n p) 

since  the  union  is  really  a countable  union  of  B*  sets  by  the  &>i-c.c.. 

A crucial  observation  is  that  for  any  p e B*,  name  X such  that  p |b  X c k, 
and  ret  with  0 < r < 1, 


(*)  if  Vq  < p3s  < q(pcs(X)  < r).  then  pp ( X ) < r 

(and  an  analogous  assertion  holds  with  the  last  two  <’s  replaced  by  >’s).  To  show 
this,  let  A — { p„  | n e co]  be  a maximal  antichain  below  p so  that  /ip„  (X)  < r for 
n e co.  By  the  u>\  -completeness  of  U,  there  is  an  E e U such  that  for  each  f e E, 
wk  ([[f  e I]  fl  pn)  < r ■ mK{pn)  for  every  n e co.  For  such  §,  the  maximality  of 
the  antichain  implies  that 

mK(K  e^]n  p)  = Y,nmK (K  e AJ  n pn)  < r ■ J2nmr(p„)  = r ' mK(p)  , 
and  hence  ptp{X)  < r. 

The  theorem  is  established  by  showing  that  for  any  £>*-generic  G, 
p\  V{k)  — >■  [0,  1]  given  by 


p(X)  = \m({pp(X)  | p e G}) 
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where  X is  any  name  for  X,  is  a /x-additive  measure  over  k in  V[G\.  To  verify  that 
the  limit  exists,  does  not  depend  on  the  name  X,  and  has  the  requisite  properties, 
Solovay  used  the  Radon-Nikodym  Theorem  of  measure  theory;  proceeding  more 
gingerly,  a lim  inf  is  taken  instead,  following  Jech  [03:  416ff): 

First  define  for  each  p e B*  a /x*  on  names  X such  that  p ||-  X c k by 

li;(X)  = \nf({,iq(X)  | q < p})  . 

Then  let  /x  be  a name  such  that  ||-  /x:  V(k)  — > [0,  1]  and  satisfying 

lb  X c K -»  /x(X)  = sup({/x;(X)  | p e G}) 

(where  G is  the  canonical  name  for  the  generic  object).  There  is  a basic  connection 
between  /x*  and  /x:  For  any  p e B*,  name  X such  that  p |[  X c k,  and  r e [0,  1], 

(**)  B*P(X)  > r iff  p |b  f(X)  > r . 

The  forward  direction  is  clear.  For  the  converse,  suppose  that  p |b  // ( X ) > r,  and 
let  /■  o e K satisfy  0 < ro  < r.  It  suffices  to  show  that  /x*(X)  > rg:  By  definition 
of  /x,  p |b  3s  e G(/x*(Z)  > ro).  This  means  that  any  q < p is  compatible  with 
some  s such  that  fi*(X)  > ro,  and  since  generally  speaking  p\  < pi  implies  that 
Mp,  (X)  > /x*2(Z),  it  follows  that 

V^r  < p3s  < q(p,*(X)  > r0)  . 

With  any  /70  < P playing  the  role  of  p here,  it  follows  from  the  >-variant  of  (*) 
that  /ipf  X)  > r0.  Hence,  p*p(X)  > r0. 

The  proof  is  completed  by  establishing  that  |b  /x  is  a well-defined  /(--additive 
measure.  By  (i)  above,  if  X e U,  then  |b  f{X)  — 1;  and  if  X f U,  then 
|b  i'l(X)  — 0.  Also,  if  p lb  X C Y C k,  then  for  any  q < p,  p*(X)  < p-*(Y) 
by  (ii)  above,  and  hence  p |b  (i(X)  < (Otherwise,  there  would  be  a 

q < p and  a rational  r such  that  q |b  f(X)  > r > jx(Y),  which  by  (**)  leads 
to  the  contradictory  p*{X)  > r > /x*(K).)  In  particular,  if  p |b  X = Y,  then 
p |b  jiiX)  = p(Y),  and  so  for  any  H*-generic  G,  /xG  is  well-defined  on  elements 
in  V[G].  The  rest  of  the  proof  is  devoted  to  showing  that  |b  /x  is  k -additive: 

We  first  verify  that  |b  /x  is  finitely  additive,  for  which  it  suffices  to  show  that 
if  |b  X,  Y C K are  disjoint,  then  |b  f{X  U Y)  = f(X)  + p(Y):  Suppose  that  r\ 
and  /*2  are  rationals  and  p |b  p(X)  > r\  A /x(F)  > ?2-  Then 

H*p(X  U Y)>  p*p{X)  + p*p{Y)  > n + r2  . 

Here,  the  first  inequality  follows  from  (iii)  above  and  the  definition  of  /x*,  and  the 
second,  from  (**).  So  again  by  (**),  p |b  f(X  U Y)  > r\  + r2-  Since  p,  r\,  and 
r2  were  arbitrary,  it  follows  that  |b  /i(A  U Y)  > f(X)  + p(Y). 

In  the  other  direction,  assume  to  the  contrary  that  |b  fi(XUY)  > /l(X)+p(Y). 
Then  it  is  simple  to  see  that  for  some  p and  rationals  r2,  r<±,  rs,  and  r&. 
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P lb  AW  < r3  < r4  A A(F)  < r5  < r6  A /x(X  U F)  > r4  + r6  . 

For  any  q < p,  p.*(X)  < r2  by  (**),  and  so  there  is  an  5 < q such  that  p,s(X)  < r 3. 
By  (*),  it  follows  that  p.p(X)  < r 3.  Similarly,  p,p(Y)  < r5,  and  consequently 

P-*p(X  U F)  < /UP(X  U F)  = fip(X)  + p,p(Y)  < r3  + r5  < r4  + r6  , 

which  by  (**)  contradicts  p |b  p(X)  U (F)  > r4  + r&. 

To  verify  k -additivity,  suppose  that  y < k and  |b  1 X — IJl^a  I a < y}  C.  k 
is  a pairwise  disjoint  union1.  By  finite  additivity,  |b  /1(X)  > In  the 

other  direction,  assume  to  the  contrary  that  for  some  p and  rationals  r\,  r2, 

P lb  T.aP(Xa)  < h <r2<  KX)  . 

For  any  t e [y]<w,  let  |b  X,  = |Ja€f  Xa.  Then  p |b  p(X,)  < f\  by  finite  additivity, 
and  so  an  application  of  (*)  as  in  the  previous  paragraph  shows  that  pp(X,)  < r\. 
Since  t was  arbitrary,  pp(X)  < r\  by  (iii)  above.  Hence,  p*(X)  < r2,  which  by 
(**)  contradicts  p |b  fi(X)  > r2.  This  completes  the  proof.  H 

Kunen-Paris  Forcing 

Kunen  and  Paris  in  their  [71]  provided  the  first  widely  applicable  method  for 
preserving  properties  of  measurable  cardinals  k in  forcing  extensions  by  p.o.’s 
P such  that  |P|  > k,  and  used  it  to  establish  consistency  results  about  saturated 
ideals  I over  k such  that  sat(7)  > k.  Their  basic  scheme  is  presented  in  a form 
that  anticipates  later  developments;  the  rest  of  the  section  is  then  devoted  to  its 
applications. 

Suppose  that  k is  a measurable  cardinal,  U a normal  ultrafilter  over  k,  and 
j:  V -<  M = Ult(V,  U).  Some  general  conditions  are  imposed  on  a p.o.  so  that  j 
can  be  extended,  in  a weak  sense,  in  an  intermediate  generic  extension.  Assume 
that  P is  a p.o.  so  that,  regarding  j(P)  as  a p.o.  in  V, 

(i)  j (P)  = P * Q under  an  identification  through  which 

(ii)  j(p)  = (p,  1Q)  for  every  p e P. 

These  conditions  are  met  by  a variety  of  p.o.’s  that  can  be  analyzed  as  iterations. 

Let  G be  j (P)-generic  over  V.  Then  by  (i)  and  10.9  there  are  corresponding 
Go  and  G\  such  that  G — Go*  G 1,  Go  is  P-generic  over  V,  G\  is  <2G°-generic 
over  V[G0],  and  V[G]  = V[Go][Gi].  Moreover  by  (ii), 

(*)  if  p e Go,  then  j(p)  e G . 

Since  M c V,  G is  also  j ( P (-generic  over  M,  and  we  can  consider  the  generic 
extension  M[G]  of  M.  The  key  observation  is  that  with  (*)  the  embedding  j can 
be  extended  to  an  elementary  embedding 

J:  V[G0]  -<  M[G ] 

definable  in  V[G].  The  idea  is  to  apply  j to  forcing  names: 
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Suppose  that  x e V[G o]  and  Jc  is  any  P- name  such  that  iG°  = x.  Then  j(x) 
is  a j{P)- name,  so  we  can  set 


j(x)  = j (i)M[G]  . 

This  is  well-defined,  i.e.  does  not  depend  on  the  choice  of  name  for  x:  Suppose 
that  v is  another  name  and  p e Go  is  such  that 

P lh/>  x — y . 


Then  j ( p)  e G by  (*),  and  in  M, 

j(P ) b(/>)  j(x)  = j(y)  . 

Hence,  j(x)M [G]  = j (y)M^-G\  The  verification  that  j is  elementary  is  analogous, 
with  the  atomic  formula  x — y replaced  by  a general  formula.  Finally,  j extends 
j,  since  if  x e V,  then  x — xG°,  j (x ) = j (x)  (under  any  reasonable  formalization 
of  the  forcing  language),  and  so  j (x)  = j(x). 

The  embedding  j generates  an  ultrafilter.  For  iV  an  inner  model, 

W is  an  N -normal  ultrafilter  over  k iff 

(i)  W is  an  ultrafilter  on  V(k)  fl  N,  and 

(ii)  For  any  / eKK  fl  N with  {§  < k \ /(f)  < f } e W 
there  is  an  a < k such  that  {f  < k \ /(f)  = a}  e W. 

Note  that  (ii)  subsumes  what  would  be  called  N-k -completeness.  Of  course,  the 
point  is  that  W e N is  not  required.  The  following  recalls  5.6: 

17.6  Exercise.  In  V[G ] with  j as  above,  define  U by: 

X eU  iff  X e V(k)  n V[G0]  Are  J(X)  . 

Then  U is  a V\Go\~normal  ultrafilter  over  k such  that  U 3 U . H 

This  in  sum  is  the  basic  Kunen-Paris  scheme;  its  applications  are  based  on 
those  P such  that  the  Q part  of  j(P)  = P * Q is  tame.  For  example,  if  forcing 
with  Q adds  no  new  subsets  of  k,  then  the  U of  17.6  verifies  the  measurability  of 
k in  V[G].  Also,  the  following  lemma  may  be  applicable  in  V[G\  to  the  dual  of 
U to  yield  saturated  ideals  in  intermediate  models  N,  V[Go]  eye  V[G].  For 
N an  inner  model, 

/ is  a N-X-saturated  ideal  over  k iff 

(i)  / is  an  ideal  on  V(k)  fl  N such  that  if 

y < k and  / e Y J fl  N,  then  Ua<y  /(«)  e J > and 

(ii)  For  any  g e XJ+  fl  N there  are 

a < f < k such  that  g(a)  fl  g(/3)  e J+  . 
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17.7  Lemma.  Suppose  that  k and  X are  regular  and  P is  a p.o.  with  the  X-c.c..  If 
|br>  J is  a V-A-saturated  ideal  over  k , 

then 

I = {X  <Z  k \\\-P  X e j} 
is  a X-saturated  ideal  over  k. 

Proof.  Suppose  that  {Xa  \ a < A}  c /+.  Then 

(*)  |b  \{a  <X  | e J+}\  =X  . 

If  not,  by  the  regularity  of  X and  the  k-c.c.  of  P,  there  would  be  a y < X such 
that 

lb  {a  < X | Xa  e /+}  c Y • 

But  then,  XY  e /,  which  is  a contradiction.  With  (*),  the  y-k-saturation  of  J 
implies  that  there  is  a p & P and  a < < X such  that  p |b  Xa  C\  Xp  e j+, 

i.e.  XanXpeI+.  H 


Kunen  and  Paris  showed  in  applications  of  their  scheme  that  straightforward 
Easton  products  (as  described  before  10.7)  preserve  large  cardinal  properties.  The 
first  application  preserved  measurability,  and  showed  that  while  normal  ultrafilters 
may  be  special,  it  is  consistent  to  have  the  maximum  possible  number  of  them. 

17.8  Theorem  (Kunen-Paris  [71]).  Suppose  that  k is  measurable.  Then  there  is  a 
k-c.c.  p.o.  E such  that 

Ibfi  There  are  22*  normal  ultrafilters  over  k . 

Proof.  Let  P be  the  Easton  product  of  the  p.o.’s  for  adding  a generic  subset  to 
each  successor  cardinal  less  than  k.  P can  be  regarded  as  consisting  of  those 
functions  p with  dom( p)  c k x k and  ran(/?)  C 2 such  that 

(i)  for  any  (£,  £)  e dom(/?),  £ is  a successor  cardinal  less  than  k and 
C < £ , and 

(ii)  for  any  regular  v,  |{(£,  £)  e dom(/2)  | ^ < v}|  < v . 

Here,  the  £ < v comprises  the  case  § < v for  the  Easton  product,  and  the  case 
§ = v (only  for  successor  cardinals  v)  for  adding  a subset  at  v.  Note  that  \p\  < k 
for  every  p e P by  taking  v = k.  P is  ordered  by:  P <p  q iff  P ^ q- 

Some  observations  generally  applicable  to  Easton  products  are  now  made, 
partly  to  establish  a context  for  later  proofs.  First,  there  is  a standard  product 
analysis:  For  any  y,  set 


(P)y  — {p  e P | p c y x k x 2}  , and 
{P)Y  = {p  e P | p n (y  x k x 2)  = 0}  . 
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Then  P = (P)Y  x ( P)Y  under  the  natural  identification  of  p e P with 
{pC\y  xk  x2,  p—yxKX  2).  (P)y  is  y+-closed,  and  if  y<r  — y,  then  (P)r  has 
the  y+-c.c.  by  a typical  application  of  the  A-system  Lemma  10.4.  Hence  by  10.8, 
forcing  with  (P)r  after  (P)y  adds  no  further  subsets  of  y and  preserves  y+  as  a 
cardinal.  Since  there  are  arbitrarily  large  such  y < k,  it  follows  that  k remains 
inaccessible  after  forcing  with  P.  We  next  observe  that  because  k is  Mahlo,  P 
has  the  k- c.c.: 

Suppose  that  A c P is  a maximal  antichain.  By  6.2(b),  let  S < k be 
inaccessible  such  that 

{Vs,  e,PnVs,An  V5>  -<  {VK,  e,  P,  A)  . 

For  any  p e P,  pH  Vs  e Vj  by  (ii),  so  by  the  maximality  of  A n Vs  in  Vs,  pC i Vs 
is  compatible  with  some  element  of  A fi  Vs.  But  then,  it  is  readily  seen  that  p 
itself  is  compatible  with  that  element.  Hence,  A (T  Vj  is  a maximal  antichain  of 
P,  so  that  A = A Ci  Vs  and  |A|  < k. 

Suppose  now  that  U is  a normal  ultrafilter  over  k,  and  j:  V < M = 
Ult(V,  U).  Then 

j(P)  = P x Q , where  Q = ( j(P))K  , 

since  (j(P))K  = P.  Here,  for  any  p e P,  j(p)  is  identified  with  (p,  0}  = 
(p,  llg),  since  \p\  < k and  so  j(p)  — p.  Hence,  the  Kunen-Paris  conditions  are 
satisfied. 

Generally  speaking,  the  U of  17.6  is  seen  by  the  definition  of  j to  correspond 
to  a P x Q-name  U such  that  for  any  P-name  X for  a subset  of  k and  (/;.  q)  e 

PxQ' 

{ P , q)  lh  X e U **  k e j(X)  . 

In  the  present  case,  because  P x Q is  identified  with  j(P)  by  taking  {p,  q)  to 
p U q,  if  q = [fq]u  where  fq:  k ->  P,  it  follows  that 

(*)  {p,  q)  lh  x e U iff  {|  < *:  I p U /,(£)  lh  | e X}  e U . 

Furthermore,  Q is  K+-closed  (since  k is  not  a successor  cardinal).  Because  P 
has  the  k+-c.c.,  it  follows  from  10.8  that  forcing  with  Q after  P adds  no  further 
subsets  of  k.  Hence, 


|| -pxq  U is  a normal  ultrafilter  over  k . 

To  get  22K  normal  ultrafilters,  proceed  as  follows:  In  V,  set  A,  = 22‘  and  let  R be 
the  v+ -product  of  A copies  of  Q,  i.e.  the  p.o.  consisting  of  functions  r exQ  such 
that  | {or  < A | r(a)  / 0} | < k,  ordered  by:  r <R  s iff  Va  < A (r(a)  <q  s{a)). 
Then  force  with  P x R instead.  It  is  readily  seen  that  P x R is  also  v+-closcd, 
so  for  each  a.  < A there  is  a P x P-name  Ua  corresponding  to  U satisfying 


Ih^xR  Ua  is  a normal  ultrafilter  over  k , 
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such  that  for  any  (p,  r)  e P x R and  P- name  X , 

(**)  (p,  r)  lbpx«  X e JJa  iff  {p,  r(a))  \\-PxQ  X e U . 

A genericity  argument  shows  that  for  a < f)  < X,  |bpXR  Ua  f Up'- 

Given  any  (p,  r)  e P x P,  let  (p,  7)  < ( p , r ) be  such  that  7(a)  _L  7(f))  in 
Q.  If  [g]c/  — h(a)  and  [h]u  — 7(f)),  it  follows  that 

Y = {$  < k | g(H)  -1  h(g)  in  P}  e U . 

Let  X be  a P-name  for  a subset  satisfying 


Ibp  e X ^ f e Y a g(|)  e G) 

(where  G is  the  canonical  name  for  the  generic  object),  i.e.  for  any  z e P, 
z H e X ^ e F Az  </>  g(f)-  Then  using  (*)  and  (**)  it  is  straightforward 
to  verify  that 

(p,  7)  |b  X e Ua  - Up  . 

Finally,  X — 7r  in  the  sense  of  the  generic  extension:  Since  there  are  X 
distinct  ultrafilters  over  k,  it  remains  to  verify  that  2 2‘  < X:  Clearly  P x R has 
cardinality  X,  and  a use  of  the  A-system  lemma  10.4  shows  that  it  has  the  (2*r)+- 
c.c..  Hence,  the  computation  of  10.5  yields  the  desired  result.  (Actually,  it  is  seen 
that  22*  can  be  made  arbitrarily  large  in  the  generic  extension.)  H 

The  next  two  theorems  establish  consistency  results  about  ideals  I over  k 
such  that  sat(7)  = k and  sat(7)  = k+ . 

17.9  Theorem  (Kunen-Paris  [71]).  Suppose  that  k is  measurable.  Then  there  is  a 
p.o.  E such  that  lb#  (a)  + (b)  + (c),  where: 

(a)  2*°  = k. 

(b)  There  is  a K-saturated  ideal  over  k. 

(c)  For  no  X < k is  there  a X-saturated  ideal  over  k. 

Proof  Let  P be  the  Easton  product  of  the  p.o.’s  for  adding  k subsets  of  co  and 
of  each  successor  cardinal  less  than  k.  P can  be  regarded  as  consisting  of  those 
functions  p with  dom(/z)  C k x k x k and  ran(/z)  C 2 such  that 

(i)  for  any  (§,  f , r))  e dom(/?),  § is  co  or  a successor  cardinal  less  than 
k,  and  t;  < | , and 

(ii)  for  any  regular  v,  |{(£,  f rj)  e dom(/z)  | $ e v}|  < v , 

intending  p(($,  f >]))  — 1 iff  “the  ?;th  new  subset  of  £ contains  f Note  that 
|p | < k for  every  p e P by  taking  v = k.  P is  ordered  by:  P <p  q iff  P 2 q- 
As  for  17.8,  P has  the  /c-c.c.. 

Suppose  now  that  U is  a normal  ultrafilter  over  k and  j:  V < M = Ult(y,  U). 


Let 
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Then 


P\  = {p  e j(P)  I P c (jOc)  - k)  x j'Oc)  x ;(k)  x 2}  , and 
Pi  = (p  e j'(^)  I P c k x k x 0'(/c)  - /c)  x 2}  . 


;(P)  = P x Pxx  P2 


through  a natural  identification.  Here,  for  any  p e P,  j(p)  is  identified  with 
(p,  0,  0}  = (p,  Up,,  llp2}  since  |p|  < /c  and  so  j(p)  = p.  Hence,  the  Kunen- 
Paris  conditions  are  satisfied. 

We  now  proceed  to  show  that  P x P\  is  the  desired  p.o..  Let  Go  be  P-generic 
over  V and  G\  Pi-generic  over  V\Go\.  To  invoke  the  Kunen-Paris  scheme  let  G2 
be  P2-generic  over  V[Go][Gi],  and  set  G — Go  x G\  x G2.  Since  Pi  is  /r+-closed 
(as  k is  not  a successor  cardinal),  it  follows  from  10.8  that  forcing  with  Pi  after 
P adds  no  further  subsets  of  k.  Hence, 

(*)  the  U of  17.6  is  a V[Go][Gi]-normal  ultrafilter  over  k . 

Observe  next  that  ||-pxPi  Pi  has  the  k-c.c.:  Given  {pa  \ a < k)  C.  P2,  let 


X = {r)  \ 3a3$3?(($ , f,  rj)  e dom  (pa))}  C J(k)-k. 


Then  |X|  < k,  and  so  an  injection:  X ->  k can  be  used  to  associate  to  each  pa  a 
qa  e P so  that:  pa  _L  pp  iff  qa  _L  qp.  Thus,  the  k-c.c.  of  P implies  the  k-c.c.  of 
P2.  By  a standard  antichain  argument,  another  appeal  to  the  k-c.c.  of  P further 
shows  that  ||-p  P2  has  the  k-c.c..  Finally,  Pi  is  /e+-closed,  so  10.8  ensures  that 
II-pxp,  Pi  has  the  k-c.c.. 

Using  this  and  (*),  17.7  can  be  applied  to  the  dual  of  U to  conclude  that  in 
V[Go][Gi]  there  is  a ic-saturated  ideal  over  k.  That  2K°  = k also  holds  follows 
from  standard  arguments  (using  10.5  and  the  k-c.c.  in  one  direction).  To  complete 
the  proof,  it  is  verified  that  in  V[Go][Gi]  there  are  no  L-saturated  ideals  over  k 
for  any  X < k: 

Assume  to  the  contrary  that  there  is  such  an  ideal  for  some  X < k,  which 
can  be  taken  to  satisfy  X<k  = X.  Then  by  a product  analysis  as  in  17.8,  P = 
( P );  x (P)'  where  (P)A  has  the  X+-c.c.  and  (P)x  is  L+-closed.  Go  can  thus 
be  considered  as  H0  x Pl\  where  H0  c (P)A  and  Hi  c (Pf.  If  V[Go][Gi]  is 
regarded  as  a generic  extension  of  LtHiHGi]  using  (P)A,  then  by  17.7  there  is  a 
L+-saturated  ideal  in  V[H\][G\].  Now  2 1 < k holds  in  this  model  as  (P);  x P\ 
is  seen  to  be  A+-closed,  so  it  follows  from  16.4(a)  that  k is  measurable  there. 
However,  2X+  — k also  holds  in  the  model  because  of  (P)x.  This  is  a contradiction. 

H 


17.10  Theorem  (Kunen-Paris  [71]).  Suppose  that  k is  measurable.  Then  there  is 
a p.o.  E such  that  |[p  (a)  + ( b ) + (c),  where: 

(a)  k is  weakly  compact. 

(b)  There  is  a k+ -saturated  ideal  over  k. 

(c)  There  is  no  k -saturated  ideal  over  k. 


232 


Chapter  4.  Aspects  of  Measurability 


Proof.  Let  P be  the  Easton  product  of  the  p.o.’s  for  adding  a generic  subset  to 
each  regular  cardinal  less  than  k,  i.e.  P is  as  in  the  proof  of  17.8  except  that 
“successor”  is  replaced  by  “regular”  in  its  (i).  Again,  P has  the  k-c.c.. 

Suppose  that  U is  a normal  ultrafilter  over  at  and  j:  V -<  M = Ult(V,  U). 
Let 

P\  =lpe  j(P)  I P c (j(K)  - k + 1)  x j(K)  x 2}  and 
Pi  = {p  e j(P)  | p c {/c}  x k x 2}  . 

Then  Pi  is  /c+-closed,  P2  is  /c-closed  and  has  the  /c+-c.c.,  and  since 

j(P)  = P x Pi  x P2 

through  a natural  identification  which  for  any  p e P identifies  j(p)  with 
(p,  11  P[,  11  Pl),  the  Kunen-Paris  conditions  are  again  satisfied. 

We  now  proceed  to  show  that  P x Pi  is  the  desired  p.o..  Let  Go  be  P-generic 
over  V and  Gi  Pi-generic  over  V[Go].  To  invoke  the  Kunen-Paris  scheme,  let 
Gi  be  P2-generic  over  V[Go][Gi],  and  set  G = Go  x Gi  x G2.  As  in  the  previous 
proof,  the  /c+-closure  of  Pi  implies  that  in  V[G],  there  is  a V[Go][Gi]-normal 
ultrafilter  over  k.  Since  P2  has  the  /c+-c.c.,  it  follows  from  17.7  that  there  is  a 
k+ -saturated  ideal  over  k in  V[Go][Gi]. 

We  next  verify  that  k is  weakly  compact  in  V[Go][Gi]:  By  7.8,  k is  weakly 
compact  iff  k is  inaccessible  and  has  the  tree  property,  and  by  10.8  it  suffices  to 
establish  these  properties  in  V[Gq],  First,  k is  inaccessible  in  V[Go]  by  remarks 
at  the  beginning  of  the  proof  of  17.8.  Next,  Suppose  that  (T,  <t)  is  a /c-tree 
in  V[Go]  with  T c at.  Applying  the  Kunen-Paris  j : V[Go]  -<  M[G]  extending 
j,  ( j(T ),  j(<r))  is  a j(ic)- tree  in  M[G].  Hence,  any  y at  its  Kth  level  determines 
a k -branch  through  T since  j(T)  H k — T . It  remains  to  show  that  such  a branch 
already  exists  in  V[Go].  This  is  not  immediate  since  V(k)C\ V[Go]  = P(/c)nV[G] 
fails;  we  work  in  V[Go]  and  rely  on  the  fact  that  V[G]  is  a generic  extension  by 
Pi  x P2,  a /c-closed  notion  of  forcing: 

By  the  above  remarks,  there  is  a Pi  x P2-name  r such  that 

l|-Pix/>2  t is  a /c -branch  through  T . 

Using  /c -closure,  define  conditions  pa  e P\  x !\  and  sets  ba  for  a < k by 
recursion  such  that  for  a < < k:  (i ) pa  ||-  r fl  a — ba,  and  (ii)  pp  < pa.  But 
then,  b — Uff<K:  is  a ^-branch.  (Note  that  we  did  not  need  any  p such  that 

P lb  r = b.) 

To  complete  the  proof,  it  remains  only  to  verify  that  there  are  no  a: -saturated 
ideals  over  k in  L[Go][Gi].  To  do  this,  it  suffices  by  16.4(b)  to  argue  in 
L[G0][Gi]  that  k cannot  be  measurable  there:  For  any  regular  a < at,  P had 
added  a generic  set  g(a)  C a such  that  g(a)  fl  § e V for  any  § < a.  If  at 
were  measurable  and  W a normal  ultrafilter  over  k,  let  x = [g]w-  As  usual, 
x = jw(x)  fl  k,  i.e.  {a  \ g(a)  — x Da}  e W.  However,  for  any  a < /J  both  in 
this  set  g(a)  — gift)  fl  a e V,  which  is  a contradiction.  H 
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17.11  Exercise.  17.10  holds  with  “k  is  weakly  compact’’  replaced  by  “k  is  weakly 
inaccessible  and  2K°  = k ”. 

Hint.  One  approach  is  to  force  first  with  the  p.o.  E of  17.10,  and  then  do  the 
&>i-c.c.  forcing  for  adding  k Cohen  reals.  17.1  verifies  (b),  and  17.7,  (c).  H 

Some  further  consistency  results  are  cited  as  the  possibilities  for  non- 
measurable  k and  sat(7)  < k+  for  some  ideal  I over  k:  If  k is  weakly  compact, 
the  only  possibility  by  16.4(b)  is  sat(7)  = k+,  shown  consistent  by  17.10.  If  k is 
strongly  inaccessible  but  not  weakly  compact,  the  only  possibilities  are  sat(7)  = tc, 
shown  consistent  by  Kunen  [78],  and  sat(7)  = k+,  shown  consistent  by  Boos  [74]. 
If  k is  weakly  but  not  strongly  inaccessible,  then  a>  < sat(7)  < k is  consistent  by 
17.1,  sat(7)  = k by  17.9,  and  sat(7)  = k+  by  17.11.  Finally,  if  k is  a successor 
cardinal,  the  only  possibility  by  16.3  is  sat(7)  = k+ , shown  consistent  by  Kunen 
[78]  in  1972. 

This  last  result  generated  considerable  interest.  The  others  had  all  assumed 
the  consistency  strength  of  measurability,  the  exact  strength  required.  However, 
Kunen  [70]  had  previously  shown  through  inner  models  that  the  consistency 
strength  of  having  a k+ -saturated  ideal  over  a successor  cardinal  k was  strictly 
stronger  than  that  of  measurability.  He  then  proceeded  to  establish  that,  assuming 
a strong  hypothesis,  the  existence  of  a huge  cardinal  (§24),  there  is  a generic  ex- 
tension in  which  there  is  an  &>2 -saturated  ideal  over  to\.  Furthermore,  he  suggested 
prima  facie  evidence  for  the  necessity  of  some  such  hypothesis.  This  is  an  early 
instance  of  an  emerging  approach  to  the  analysis  of  combinatorial  propositions 
about  the  low  orders  of  the  cumulative  hierarchy:  By  using  the  methods  of  forc- 
ing and  inner  models,  their  consistency  strength  is  located  between  two  hypotheses 
in  the  hierarchy  of  large  cardinals.  Short  of  achieving  an  equiconsistency  result, 
their  strength  can  then  be  compared  to  others,  and  suggested  possibilities  pursued 
to  refine  the  large  cardinal  correlations.  As  for  getting  a>2 -saturated  ideals  over  w\, 
far  less  than  huge  cardinals  are  now  known  to  suffice  (see  §32,  especially  32.10). 
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18.  Prikry  Forcing 

This  section  describes  a simple  but  elegant  notion  of  forcing  for  measurable  car- 
dinals devised  by  Karel  Prikry,  as  well  as  a useful  generalization.  These  together 
with  his  forcing  results  discussed  in  §17  appeared  in  his  1968  Berkeley  dissertation 
with  Silver.  Prikry  and  also  Thomas  Jech  were  originally  members  of  Vopenka’s 
productive  seminar,  and  their  emigration  was  a Prague  spring  of  sorts  for  set  the- 
ory in  the  United  States.  Although  Prikry’s  forcing  may  have  seemed  a bit  of 
a curiosity  at  first,  it  soon  became  an  integral  part  of  the  theory  of  measurable 
cardinals.  Not  only  did  it  add  to  the  growing  store  of  intuitions  about  them,  but 
it  provided  a basic  paradigm  for  later  generalizations. 

Maintaining  some  generality,  let  F be  a filter  over  a cardinal  k.  Then  Prikry’s 
forcing  p.o.  for  F is 

PF  = [/c]<®  x F 

ordered  by 

{ s , A)  < ( t , B ) iff  t is  an  initial  segment  of  5 and  AU(j-f)CS. 

The  idea  is  that  a condition  (s,  A)  determines  a finite  initial  segment  s of  a new 
subset  of  k whose  further  members  are  to  be  restricted  to  a set  A,  kept  large  in  the 
sense  of  being  in  F.  Note  that  PF  is  not  separative,  since  for  any  (s,  A),  ( t , B)  e 
Pf , { s , A)  < ( t , B)  exactly  when  ($,  A)  < (t,  B — (max(f)  + 1)>.  It  could  be 
required  of  conditions  (s.  A)  that  A fl  (maxis)  + 1)  = 0 with  only  notational 
complications. 

18.1  Lemma. 

(a)  If  (s,  A)  ||  ( t , B),  then  s is  an  initial  segment  of  t or  vice  versa. 

(b)  PF  has  the  k+-c.c.. 

(c)  If  G is  Pp-generic  and 

x = U(*  I 3A«s,  A)  e G)}  , 

then  x is  a unbounded  subset  of  k of  ordertype  co,  and  V[x]  = V[G], 

Proof,  (a)  is  clear,  and  (b)  follows  from  the  observation  that  any  two  conditions 
with  the  same  first  component  are  compatible. 

For  (c),  that  x is  unbounded  in  k with  ordertype  > to  follows  from  a density 
argument  (by  a §0  convention,  {£  | a < f < k)  e F for  every  a < k).  On  the 
other  hand,  for  any  (s,  A)  e G,  s must  be  an  initial  segment  of  x by  (a),  and  so 
x must  have  ordertype  < to.  Finally,  G can  be  recovered  from  x since  if 

Gx  = {{s,  A)  e PF  | s is  an  initial  segment  of  x and  x — s c A}  , 

then  G — Gx  (cf.  before  11.14).  H 

Because  of  this  last  point,  for  M a transitive  e-model  of  ZFC  such  that 
M |=  is  a filter  over  a cardinal  /r1  and  x e |)c]®, 
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x is  Pp-generic  over  M iff  Gx  is  /V -generic  over  M , 
and  identifying  x with  its  increasing  enumeration,  that 
x is  a Prikry  sequence  over  M iff  x is  Pf -generic  over  M for  some  such  F . 

Since  the  cofinality  of  r is  © in  any  forcing  extension  by  Pf,  k may  be 
collapsed.  The  following  key  lemma  is  how  large  cardinal  ideas  enter  the  picture 
in  the  case  by  which  Prikry  forcing  is  primarily  known. 


18.2  Lemma  (Prikry  [70]).  Suppose  that  k is  measurable  and  U is  a normal 
ultrafilter  over  k.  Then  for  any  (s.  A)  e Pu  and formula  ip  in  the  forcing  language, 
there  is  a B C A with  B e U such  that  (s,  B)  ||  ip. 

Proof.  Define  a function  /:  [A  — (max(s)  + 1)]<<U  ->  2 by 


fit)  = 


0 if  ((s  U t,  X ) ||-  (p)  , and 

1 otherwise . 


By  Rowbottom’s  7.17,  there  is  a B c A — (max(.y)  + 1)  with  B e U homogeneous 
for  /.  It  follows  that  (s,  B)  ||  < p:  Otherwise,  there  would  be  a {s Ufo,  Bo)  < (s,  B) 
and  a {s  U t\,  B\ ) < (s,  B)  such  that  (s  U to,  Bo)  ||-  ip  and  (s  U t\,  B\)  ||-  -'ip.  By 
extending  one  condition  if  necessary,  we  can  assume  that  |?o I = |f|  I ■ But  then 
f(t0)  — 0 and  f(t] ) — 1,  contradicting  the  homogeneity  of  B.  H 


For  any  (s,  A)  e Pu  and  formula  ip,  there  is  always  a ( t , B)  < [s,  A)  that 
decides  ip;  the  point  of  the  lemma  is  that  we  can  take  t = s.  This  leads  to  a strong 
conclusion  usually  drawn  from  the  a: -closure  of  a p.o.: 


18.3  Lemma.  With  the  hypotheses  of  18.2,  for  any  Pu-name  f and  set  y with 
|y|  < k,  there  is  a z c y such  that  ||-  f C y ->  f = z. 

Proof.  By  a standard  forcing  argument  (as  for  the  maximal  principle),  it  suffices 
to  assume  that  (s,  A)  ||-  f c y and  find  a ( t , B)  < (s,  A)  and  a zeV  such  that 
( t , B)  ||-  f = z.  For  each  a e y,  by  18.2  let  (s,  Ba)  ||  a e f.  Since  |y|  < k, 
B = n«€y  B«  e U.  If  z is  then  defined  by  a e z iff  (s,  B)  ||-  a e r,  it  follows 
that  (s,  B)  \[  r = z,.  H 


The  following  summarizes  the  situation: 


18.4  Theorem  (Prikry  [70]).  Suppose  that  k is  measurable  and  U is  a normal 
ultrafilter  over  k.  If  G is  Pu-generic,  then  ( VK ) v = (VK)v ^ and  the  cardinals  of 
V and  V[G ] coincide,  yet  cfv[G](v)  = CO. 

Proof.  That  (VK)v  = (VK)v[c]  can  be  established  by  a straightforward  induction 
on  rank  using  18.3,  so  that  the  cardinals  below  k are  preserved,  k too  remains  a 
cardinal  since  it  is  a limit  cardinal,  and  the  cardinals  above  k are  preserved  by  the 
AT  + -C.C..  H 
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So,  without  disturbing  the  universe  below  at,  Pv  introduces  an  cosequence 
overlay  on  k and  the  crux  of  the  matter  is  Prikry’s  lemma  18.2.  18.4  first  solved 
the  problem  of  getting  an  extension  of  a model  of  ZFC  with  the  same  ordinals  and 
cardinals  but  a different  cofinality.  Remarkably,  later  developments  in  inner  model 
theory  would  demonstrate  that  the  consistency  strength  of  a measurable  cardinal 
is  necessary  for  this  purpose  (see  volume  II).  In  any  case,  just  for  the  preservation 
of  cardinals  alone,  a Rowbottom  filter  (as  defined  before  16.10)  suffices: 

18.5  Exercise  (Prikry  [70]).  Suppose  that  F is  a Rowbottom  filter  over  k.  Then 
forcing  with  Pf  preserves  all  cardinals. 

Hint.  It  suffices  to  show  that  no  successor  cardinal  k+  < k in  V is  collapsed 

by  Pf-  So,  set  p = A+  and  suppose  that  (s.  A)  ||-  f : X — > p.  For  each  t e 

[A  - (max (,s)  + \)]<w  let 

E,  = tn  |3$3X«*U  t,x)  lb  i($)  = I?)}  . 

\Et\  — 7-  as  f is  forced  to  be  a function,  so  set  /(f)  = sup(£',)  < p.  Let 
B c A — (max(s)  + 1)  with  B e F be  such  that  \f“[B]<w\  < co,  so  that  S = 
sup(/“[Z?]<<a)  < p.  Finally,  show  that  (s,  B)  |(  r“ A.  C.  8.  H 

Devlin  [74]  characterized  those  F over  k such  that  PF  preserves  cardinals 

by  a property  slightly  weaker  than  being  Rowbottom  which  is  implicit  in  the 
preceding  proof.  On  the  other  hand,  he  and  Paris  (Devlin  [74:  30])  established  the 
full  converse  for  PF  rendering  (VK)v  — (VK)v^  for  any  PF -generic  G:  F must 
be  a /c-complete  ultrafilter  satisfying  the  Rowbottom  partition  property  7.17  - all 
that  18.2  required  of  normality. 

It  is  not  surprising  that  PF  should  preserve  large  cardinal  properties;  the 
following  result  provided  an  example  of  a Rowbottom  cardinal  of  cofinality  co. 

18.6  Exercise  (Prikry  [70]).  Suppose  that  F is  a Rowbottom  filter  over  k.  Then 
in  any  generic  extension  by  Pp,  F generates  a Rowbottom  filter  over  k. 

Hint.  If  A < k and  (s,  A)  |[  f : [Ar]<al  ->  X,  it  suffices  to  find  a B c A with  B e F 
such  that  (s,  B)  |[  |f“[5]<w|  < co.  To  do  this,  for  each  t e [A  — (maxfs)  + 1)]<W 
let 

/(f)  = (?)  | 3u,v3X(t  — u U v A {s  U u.  X)  |[  r(v)  — ?;)}  . 

/(f)  e [L]<<a  since  f is  forced  to  be  a function.  Let  B c A — (max(^)  + 1)  with 
B e F such  that  |/“[B]<‘UI  < co,  and  set  W = U /“[B]<a).  Finally,  show  that 
(s,  B)  |b  r“[B]<w  C W.  H 

A revealing  characterization  of  Pa  -generic ity  for  normal  ultrafilters  U is 
established  next.  If  F is  any  ultrafilter  over  k (in  V)  and  x e [at]®  is  P^-generic 
over  V,  then  since  x manages  to  burrow  its  way  into  every  X e F by  genericity, 
it  is  simple  to  see  that  x generates  F in  the  following  sense: 
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WX  evn  V(k)(X  e F **x-X  is  finite)  . 

Mathias  provided  a basic  structural  rationale  for  Prikry  forcing  by  establishing  the 
converse  for  normal  ultrafilters.  Generalizing  a familiar  concept,  for  (As  \ s € 
[k]<<8)  with  each  As  C k,  its  diagonal  intersection  is 

ASA.S  = {£  < k | £ e flM*  I max(s)  < £}}  . 

If  U is  a normal  ultrafilter  over  k and  each  As  e U,  then  it  is  simple  to  see  that 

A .5  As  e U. 

18.7  Theorem  (Mathias  [73]).  Suppose  that  M is  a transitive  €-model  of  ZFC  in 
which  k is  a measurable  cardinal  and  U a normal  ultrafilter  over  k.  Then  for  any 
x e [/c]®. 


x is  /’[/-generic  over  M iff  WX  e U(x  — X is  finite)  . 

18.8  Corollary.  With  M as  above,  if  x is  Pu-generic  over  M and  y e [x]®,  then 
y is  also  Pu-generic  over  M.  H 

Proof  of  18.7.  In  the  substantive  direction,  suppose  that  x e [/c]®  and  x — X is 
finite  for  every  X e U . We  must  show  that 

Gx  = {(s.  A)  e Pu  | s is  an  initial  segment  of  x and  x — s c A] 

is  /’[/-generic  over  M,  which  reduces  to  showing  that  if  /)  e M is  dense  open  in 
Pu,  then  Gx  D D 0: 

For  each  s € [/r]<®,  define  fs:  [k  — (max(i)  + 1)]<®  ->  2 by 
[0  if  3Z((iUf,Z)  e D)  , and 

fsit)  = 

I 1 otherwise  . 

By  7.17,  there  is  an  As  c k — (max (,y)  + 1)  with  As  e U homogeneous  for  fs. 
Set  A = A ,Aj  e U. 

Next,  for  each  s e [/c]<®,  if  there  is  an  X such  that  (s,  X)  e D,  let  Bs  be 
such  an  X;  otherwise,  let  Bs  = k.  Set  B — A D ASBS  e U . Then 

(*)  V.?  e [a:]<®(3X({5,  X)  e D {s,  B - (max(s)  + 1)>  e D))  , 

noting  that  D is  dense  open. 

By  hypothesis,  there  is  an  initial  segment  so  of  x such  that  x-JoCB.  Now 
by  density  there  is  a t e [B  — (max(so)  + 1 )]<®  such  that  (s0  U t,  X)  e D for  some 
X.  Consequently,  if  to  consists  of  the  first  \t\  members  of  x — so  c B , then  the 
homogeneity  of  B — (max (so)  + 1)  c A.So  for  /Jo  implies  that  (so  U to,  X)  e D for 
some  X.  But  then  by  (*),  (so  Uto,  B — (max(fo)  + 1))  e D.  This  condition  is  also 
a member  of  Gx,  and  so  the  proof  is  complete.  H 
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18.8  is  analogous  to  11.14(b)  for  Mathias  reals,  and  indeed  the  two  were 
proved  together  by  Mathias.  The  analogies  extend  to  the  circumstance  that  Mathias 
forcing  is  equivalent  to  first  adjoining  an  ultrafilter  U over  u>  with  strong  properties 
by  an  &>i-closed  forcing,  and  then  forcing  with  Pv  (Mathias  [77:  §4]).  An  external 
ft)  sequence  generating  an  internal  normal  ultrafilter  also  occurs  in  the  theory  of 
iterated  ultrapowers  as  described  in  the  next  section.  To  conclude  it,  we  establish 
a connection  with  Prikry  forcing  extensively  studied  by  Patrick  Dehornoy.  Claude 
Sureson  [89]  has  studied  the  interplay  of  various  Prikry  extensions  in  terms  of 
degrees  of  constructibility. 

As  inner  model  theory  developed  through  the  1970’s,  intriguing  results  were 
to  provide  intimate  connections  among  the  forcing  and  inner  model  phenomena 
and  the  concept  of  generating  indiscemibles.  With  this  synthesis,  Prikry  forcing 
has  become  a integral  part  of  the  study  of  measurability. 

A Generalization 

Prikry  [70: 17, 18,38]  also  considered  a generalization  of  his  forcing  for  which  a 
version  of  his  crucial  lemma  18.2  can  be  established  without  assuming  the  partition 
property  7.17.  The  generalization  being  a natural  and  useful  one  for  a variety  of 
purposes,  it  has  since  been  rediscovered  by  Dehornoy  [78],  who  established  the 
analogue  of  18.7  for  it,  and  variants  by  Moti  Gitik  [80,86],  Andreas  Blass  [88], 
and  James  Henle  [90]  (see  also  the  work  of  Arthur  Apter  and  Henle  in  their 
[91,92]).  A general  formulation  is  provided,  the  main  properties  established,  and 
then  several  examples  given. 

Let  (/,</)  be  a non-empty  directed  set  (§0)  with  no  maximal  elements,  and 
with  </  understood  let  <WI+  denote  that  subset  of  <a>I  consisting  of  increasing 
functions,  i.e. 

<(°I+  — [s  | 3 n e cd(s:  n — » I A V/V j (i  < j < n — >■  s(i ) </  s(j)))}  . 

Next,  let  J7  be  a function  on  <ft7+  such  that  for  s e <“/+, 

(i)  T(s)  is  an  ultrafilter  over  I , and 

(ii)  for  any  i e /,  {j  \ i </  j}  e f(s)  . 

For  such  T, 

T\F  = M | A is  a function  A dom(A)  = <w/+  A V.s  e <wI+(A(s)  e ^AG))}  . 
For  A,B  e\\r  set 


dc'B  iff  Vse  <u>I+{Ms)  c B(s))  . 
Finally,  define  a notion  of  forcing  with 

p. r = <mi+  x 


ordered  by 
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{s,  A)  < {t,  B)  iff  (i)  t <Zs  , 

(ii)  ^c*g,  and 

(iii)  for  \t\  < i < |sj,  s(i)  e B(s\i ) . 

(t  c s is  equivalent  of  course  to  .v 1 1 r | = t,  i.e.  t is  an  initial  segment  of  s.) 

When  I = k,  <j  is  the  membership  relation  on  k,  and  T is  the  constant 
function  with  range  {U}  where  U is  a normal  ultrafilter  over  k,  then  Pjr  and  Pu  as 
defined  earlier  are  equivalent  forcing  notions:  Temporarily  ignoring  the  difference 
between  members  of  <<u[/c]+  and  their  ranges  in  [/c]<<a,  define  o:  Pjj  ->  Pjc 
by  setting  f({s,A))  — (s,A)  where  A is  the  constant  function:  [/r]<<a  ->  {A}, 
and  ip.  Pjr  -»  Pv  by  setting  i\({t,B))  — (t,  B)  where  B — A sB(s),  so  that 
B — (max(^)  + 1)  c B(s)  for  each  s.  Noting  that  for  any  (s.  A),  ( t , B)  e Pj?, 
(s.  A)  < i0(h((t,B)))  implies  that  {s,A}  < ( t,B ),  it  is  straightforward  to  check 
that  i(,  and  i\  are  dense  embeddings  (cf.  10.1). 

The  idea  of  the  generalization  is  that  for  a finite  s,  instead  of  considering 
extensions  drawn  from  a single  large  set,  we  consider  those  that  are  chains  in  a 
tree,  the  possible  one-point  extensions  at  each  node  being  large  in  a sense  specific 
to  that  node. 

The  following  is  the  analogue  of  18.1: 

18.9  Exercise. 

(a)  If  (s,  A)  ||  (t,  B),  then  s c.  t or  vice  versa. 

(b)  P > has  the  \I\+-c.c.. 

(c)  If  G is  P^-generic  and 

* = Ufa  I 3>t(fa,-4)  e G)}  , 

then  x:  w — > I and  is  unbounded  in  (/,  </},  i.e.  for  any  i e I there  is  an  n e a> 
such  that  i </  x(n),  and  V[x ] = V[G].  H 

The  last  assertion  of  (c)  is  established  by  showing  that  if 

Gx  = {fa,  A)  e Pp  | s c x A V/(|s|  < i < a>  — >■  s(i ) e (^ | / ) ) } , 

then  G — Gx.  Because  of  this,  for  M a transitive  e-model  of  ZFC  in  which  I 
and  T are  as  above  and  x:  a>  -»  /, 

x is  (Pjr)M -generic  over  M iff  Gx  fl  M is  -generic  over  M . 

The  following  is  the  analogue  to  18.2,  and  it  is  crucial  for  this  context  just 
as  18.2  was  for  its. 

18.10  Lemma.  Suppose  that  {s,A)  e Pjr  and  <p  is  a formula  of  the  forcing 
language.  Then  there  is  a B e w'tb  B c.*  A such  that  (s,  B)  ||  (p. 
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Proof.  The  following  is  first  established: 

(*)  For  any  t e <a>I+,  there  is  a X e T such  that  (r,  X)  ||-  <p 

iff  {i  el  | 3CF«f~</>,  y)ePr  a (n(i),  y)  |[  <p)\  e T(t)  . 


The  forward  direction  is  clear,  since  for  any  i e X)t)  with  t{\t\  — 1)  </ 
i,  (t^(i),  X)  < (t,  X).  For  the  converse,  let  W be  the  hypothesized  member  of 
Tit),  and  for  each  i e W with  f(|f|  — 1)  </  i let  y,  be  such  that  {t^{i),  yf  ||-  <p. 
Define  X e\\T  by. 


( yfu)  if  i e W and  t^~(i)  c u , 


X{u)  = 


W 

I 


if  u = t , and 
otherwise  . 


Then  it  is  simple  to  check  that  (r,  X)  ||-  cp . 

To  establish  the  lemma  assume  to  the  contrary  that  for  no  B c*  A does 
{ s , B)  ||  <p.  For  t e <0>I+  with  *C(,  set 


Aft)  = [i  e Ait ) | -3 y{(t~(i),y)  ePr  a ( t~(i),y ) ||  <p)}  . 


For  such  t satisfying  V/(|s|  < i < |t|  — > t(i)  e A!{t\i)),  it  is  straightforward  to 
check  that  Aft)  e Tit)  by  induction  on  |t|,  repeatedly  applying  (*)  for  and 
-'(p  “relativized”  to  A.  Flence,  if  we  further  set  Aft)  — Ait)  for  those  t such 
that  sgf,  then  A!  e\\T  with  A!  c*  A.  But  then,  no  ( t , X)  < { s , A!)  can  ever 
decide  <p,  which  is  a contradiction.  H 


The  following  is  the  analogue  of  18.3: 

18.11  Exercise.  Suppose  that  X is  regular  and  Tis)  is  X-complete  for  every 

s e <coI+.  Then  for  any  Pjr-name  i and  set  y with  |y|  < X,  there  is  a z C y such 
that  ||-  f C y — > f = z.  ~t 

All  this  was  known  to  Prikry  [70];  Dehomoy  [78]  went  on  to  establish  the 
following  analogue  of  the  characterization  18.7  in  his  context.  For  the  general 
result,  the  partition  property  argument  for  18.7  is  replaced  by  a well-foundedness 
argument. 

18.12  Theorem.  Suppose  that  M is  a transitive  e-model  of  7. FC  in  which  I and 
T are  as  above.  Then  for  any  <i-increasing  x:  <«->/, 

x is  )Pjr)M- generic  over  M iff 

V.4  e (]”[ T)M3n  e oAHin  < i < co  — > x)i)  e A(x\i))  . 

Proof.  The  forward  direction  is  clear  since  for  any  A e i[\T)M , {( t,B ) e 
( Pp)M  \ B c.*  A)  is  dense. 

For  the  converse  direction,  we  must  show  that 

Gx  — {(s,  A)  e {Pf)m  | s c x a V* (| j | < i < co  -»  s(i)  e ^4 (^ | / ) ) } 
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is  (Pf)M -generic  over  M,  which  soon  reduces  to  showing  that  it  meets  every 
dense  open  set  in  M. 

To  this  end,  argue  in  M:  Assume  that  I)  is  dense  open.  For  (s,  A)  e Pf  set 

t(s,  A)  = [t  \ s c.t  a Vi(|i|  < i < \t\  — > t{i)  e A(t\i)) 

A 3Be  i D)}  , 

i.e.  the  set  of  possible  first  coordinates  t of  extensions  of  (s,  A)  “unsecured”  in 
the  sense  that  there  is  still  a B such  that  (t,  B ) D.  As  D is  open,  if  to  c t\  and 
t\  e x Os,  .4),  then  to  e rfs,  A).  Define 

(s,  A)  snares  D iff  there  is  no  y:  to  —>  I such  that 
for  every  n e to , y\n  e r(s,  A) 

(i.e.  (r(j,  ^4),  D>  is  well-founded).  Then  for  any  5 e <a7+,  there  is  an  A € Y\F 
such  that  (s,  A)  snares  D: 

Arguing  as  for  18.10,  the  following  can  be  established: 

For  any  t e <M /+  there  is  a X e []  T such  that  (f,  X)  snares  I) 
iff  {i  el  | 3 y((t~(i),  y)  e P?  a y)  snares  D)}  e F(t)  . 

Using  this,  assume  to  the  contrary  that  there  is  no  A e Y\T  such  that  (s.  A) 
snares  D and  get  an  A'  e ]”[  T such  that  for  any  t e <WI+  with  s c t , 

Aft)  = {i  el  | -3  y((t~(i),y)  e Pjr  a (t~(i),y)  snares  D)}  . 

Flowever,  any  (t,X)  < (s,  A')  with  (t,  X)  e D certainly  snares  D,  which  is  a 
contradiction. 

To  complete  the  proof  of  the  theorem,  for  each  .s’  e <0:  /+  let  A e P|  F be 
such  that  (s,  _4S}  snares  D.  Define  a “diagonal  intersection”  of  the  _4j’s  by  setting 

B{t)  = f|,crA(0  for  t e <a7+  . 

Be  and  it  is  simple  to  check  that  for  any  s e <w/+,  r(s,  B)  c r(5,  ^ls),  so 
that  {s,  B)  snares  D. 

By  hypothesis,  there  is  an  new  such  that  for  n < i < to,  x(i)  e B(x\i). 
Now  (x\n,B)  snares  D,  i.e.  (x(x\n,  B),  D)  is  well-founded,  so  there  is  (0.3)  an 
order-preserving  function  p:  r(x\n,  B)  — > On.  Finally  stepping  out  of  M,  such  a 
p also  verifies  that  x(x\n,  B)M  is  well-founded  in  V,  and  consequently  there  must 
be  an  m > n such  that  (x\m,B)  e D.  But  (x\m,B)  e Gx,  and  so  the  proof  is 
complete.  H 

18.13  Corollary.  With  M as  in  18.12,  if  x is  (Pp)M -generic  over  M and  y:  to  — »■ 
1 is  < i -increasing  with  ran(y)  C ran(x),  then  y is  also  (P f)M -generic  over  M.-\ 

Having  duly  established  that  the  main  features  of  Prikry  forcing  carry  over 
to  the  general  setting,  this  section  is  concluded  with  several  examples: 
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1.  (Prikry  [70:38])  I is  k+ , <j  the  membership  relation  on  k+  and  T a 
constant  function  with  range  {[/}  where  U is  a a: -complete  ultrafilter  over  k+  . Then 
forcing  with  /y  adds  no  new  bounded  subsets  of  k and  preserves  all  cardinals 
except  k+,  whose  cofinality  is  changed  to  co. 

2.  (Prikry  [70:38])  I is  at+,  </  the  membership  relation  on  k+ , and  there 
is  an  increasing  sequence  (k„  \ n e co)  of  cardinals  cofinal  in  k such  that  Un 
is  a K„-complete  ultrafilter  over  at+  for  n e co  and  F(s)  = U ^ for  s e <t0I+. 
Then  again  forcing  with  P j?  adds  no  new  bounded  subsets  of  k and  preserves  all 
cardinals  except  k+  , whose  cofinality  is  changed  to  co.  (But  here,  we  must  first 
note  that  the  B of  18.10  can  also  be  required  to  satisfy  B(t)  — A(t)  for  s g f 
by  suitably  adjusting  the  proof,  and  using  this,  that  for  18.11  T{. s)  need  only  be 
k-complete  for  |s|  sufficiently  large.) 

3.  (Dehomoy  [78])  I is  k > co,  </  the  membership  relation  on  k,  and  T a 
constant  function  with  range  {(7}  where  U is  a (not  necessarily  normal)  a: -complete 
ultrafilter  over  k.  In  this  case,  forcing  with  Pjr  has  an  equivalent  formulation  which 
is  more  like  Prikry  forcing: 

Let  f e kk  be  such  that  [f]v  = k,  i.e.  for  any  a < k,  {§  < k \ a < /(f)]  e 
U,  yet  for  any  g e kk  such  that  {§  < k \ g(^)  < /(f)]  e U there  is  an  a < k 
satisfying  {£  < k \ g(%)  = a]  e U.  For  (As  \ s e [at]<<u)  with  each  As  c at,  its 
f-diagonal  intersection  is 

A{ As  = {$<tc  ||e  I max(.s)  < /(§)}}  . 

As  for  the  case  when  / is  the  identity  map:  k — > at,  i.e.  U is  normal,  it  is  simple 
to  check  that  if  each  As  e U,  then  a{  As  e U. 

Setting 


<m[K]f  = {se  <>]  | Vi  < |S|  - l(s(0  < f(s(i  + 1)))}  , 


let 

pf  = <°>[K]f  x U 

ordered  by 

(s,  A)  < (t,  B)  iff  t is  an  initial  segment  of  s and  A fl  (ran(s)  — ran(r))  c B . 

It  can  be  assumed  that  /(§)<§  for  each  f < at,  and  so  this  is  a stringent  version 
of  Prikry  forcing  where  beyond  s(i)  < s(i  + 1)  we  require  40  < / (s(i  + 1))  < 
s(i  + 1). 

Echoing  remarks  before  18.9  and  again  temporarily  ignoring  the  difference 

f 

between  members  of  <w[k\+  and  their  ranges  in  [at]<w,  define  0:  P(j  -»  Pjr  by 
setting  i0((s,  A»  = (s.  A)  where  A is  the  constant  function  [at]<<w  — > [A],  and 
ii:  Pjp  — > p{j  by  setting  i\{{t,B))  — ( t , B)  where  B = A { B(s).  Note  that  for 
any  { t , B)  e P f,  if  B'\  [a  ]<<0  — >■  U is  defined  by 


B'(s)  = B(s ) n {§  < k | max  CO  < /(f)]  e U , 
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then  for  any  (s,A)  < (t,Br)  and  i + 1 e domC?)— dom(t),  s(i)  < f(s(i  + 1)). 
Consequently,  {s,A)  < ?oO’i((C  B')))  implies  that  (s.  A)  < { t,B ').  It  is  now 
straightforward  to  check  that  iq  and  i\  are  dense  embeddings. 

Gitik-Magidor  [92]  used  this  forcing  in  the  Py  version. 

4.  With  k < y,  I is  VKy  = {x  c y | |x|  < k},  </  the  proper  inclusion 
relation  on  VKy,  and  T a constant  function  with  range  {U}  where  U is  a k- 
complete  ultrafilter  over  VKy  such  that  {x  eVKy  \ a e x}  e U for  every  a < y. 
Then  forcing  with  Pjr  adds  no  new  bounded  subsets  of  k,  and  for  every  regular 
<5  with  k < S < y renders  its  cofinality  a>:  if  x:  a>  —>■  VKy  is  /V-generic,  then 
U neo>x(n)  Cl  8 = 8.  Since  the  cardinal  successor  of  k in  the  generic  extension 
must  be  regular  there  as  well  as  in  V,  it  follows  that  no  8 with  k < 8 < y remains 
a cardinal. 

5.  (Gitik)  The  same  as  4.,  except  that  </  is  the  relation  < defined  on  VKy 
by: 

x < y iff  iCj  and  \x\  < |y  Cl  k\  . 

(See  the  end  of  §25  for  more  on  this  relation.)  Then  forcing  with  If  has  the 
properties  described  in  4.  and  moreover  has  an  alternate  formulation  as  in  3.  See 
Gitik  [80,86]  and  Apter-Henle  [91,92]  for  applications. 

6.  I is  w and  </  the  membership  relation  on  &>.  Blass  [88]  considered  various 
possibilities  for  T and  extended  results  of  Mathias  [77]  established  using  Mathias 
forcing  (see  the  end  of  §1 1).  Woodin  and  also  Judah-Shelah  [91]  showed  assuming 
CH  how  to  get  an  T so  that  Pjf  (and  hence  an  u> i-c.c.  notion  of  forcing)  adjoins 
a real  of  minimal  constructibility  degree. 

As  a large  part  of  the  increasing  use  of  strong  hypotheses  to  establish  relative 
consistency  results,  Prikry  forcing  was  to  be  generalized  in  various  directions  for 
a variety  of  purposes.  The  possibility  of  adjoining  a new  unbounded  subset  of 
some  large  k with  minimal  disturbance  to  the  universe,  as  in  the  basic  paradigm, 
came  to  be  considered  a prominent  feature  of  large  cardinals.  In  the  decades  to 
follow,  generalizations  for  changing  cofinality  were  to  become  central  to  the  study 
of  the  possible  values  of  2X  for  singular  strong  limit  X,  and  equiconsistency  results 
were  to  be  established  using  an  elegant  and  technically  sharp  generalization  for 
adjoining  closed  unbounded  subsets  of  some  k while  preserving  its  large  cardinal 
properties,  Radin  forcing  (see  volume  II). 
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19.  Iterated  Ultrapowers 

Kenneth  Kunen  was  a student  with  Scott  at  Stanford  University  and  went  on  to 
become  a prominent  set  theorist  and  topologist  at  the  University  of  Wisconsin. 
Results  of  his  have  already  been  discussed  in  §17  and  elsewhere.  His  1968 
dissertation  [68]  is  rivaled  only  by  Silver’s  of  a couple  of  years  earlier  for  its  impact 
on  the  development  of  set  theory.  Sophisticated  and  wide-ranging,  it  featured 
elegant  results  about  inner  models  and  elementary  embeddings  that  considerably 
clarified  the  structure  theory  of  measurable  cardinals  and  0#.  This  section  develops 
the  techniques  that  led  to  these  results,  which  are  then  established  in  the  two 
subsequent  sections. 

Like  Silver,  Kunen  was  inspired  by  Gaifman’s  seminal  work.  Gaifman  had 
systematized  the  process  of  iterating  the  ultrapower  construction  into  the  transfinite 
and  revealed  its  potentialities  by  establishing  9. 1 . Kunen  realized  that  ultrapowers 
of  inner  models  can  be  taken  by  an  ultrafilter  not  necessarily  a member  of  the 
model,  and  the  process  iterated  if  an  “amenability”  condition  is  satisfied.  His 
approach  is  formulated  in  a way  that  anticipates  later  developments,  turning  first 
to  a generalization  of  the  ultrapower  construction  of  §5. 

Suppose  that  M is  a transitive  e-model  of  ZFC-;  i.e.  ZFC  with  the  Power 
Set  Axiom  deleted.  Suppose  also  that  k is  an  infinite  cardinal  in  the  sense  of  M ; 
k need  not  be  a cardinal  in  V . (To  the  end  of  this  chapter  especially  germane 
is  the  liberal  convention  that  k,  X, . . . may  denote  cardinals  only  in  the  sense  of 
some  transitive  e -model.)  Then 

U is  an  M-ultrafilter  over  k iff 

(i)  ( M , e,  U)  \=  U is  a normal  ultrafilter  over  k ; and 

(ii)  (M,  U)  is  weakly  amenable:  for  any 

F e KM  n M,  {§  < k | F(£)  e U]  e M . 

An  equivalent  formulation  of  (i)  was  used  for  a definition  before  17.6,  and  as 
noted  there,  normality  subsumes  k -completeness  in  the  sense  of  M,  and  U e M is 
not  required.  The  present  formulation  in  terms  of  (M,  e,  U)  anticipates  the  use 
of  such  structures  below,  so  it  should  be  pointed  out  that  when  M is  a proper 
class,  the  usual  caveats  about  the  formalizability  of  satisfaction  apply  (§0).  The 
new  condition  (ii)  is  a weak  form  of  Jensen’s  concept  of  amenability:  For  RCA, 
(A,  R)  is  amenable  iff  for  any  x e A,  R IT  x e A.  (ii)  imposes  this  condition 
on  (M,  U)  only  for  those  x e M of  cardinality  at  most  k in  the  sense  of  M.  It 
should  thus  be  clear  that  in  considering  (M,  e,  U),  U is  not  to  play  a role  in  the 
Replacement  Schema  for  M (=  ZFC-. 

The  current  definition  of  M-ultrafilter  over  k differs  from  the  original  one  in 
Kunen  [70]  in  two  respects:  First,  transitive  e -models  M not  only  of  ZFC  but  of 
ZFC-  are  allowed;  examples  are  Lx  and  II,  for  regular  /.  > o>  and  their  elementary 
substructures,  and  so  such  models  abound.  (On  the  other  hand,  such  an  M,  if  it 
contains  all  the  ordinals,  must  be  an  inner  model  of  ZFC  (cf.  before  3.1).)  Second, 
for  the  sake  of  simplicity  normality  is  imposed  on  U at  the  outset,  whereas  Kunen 
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had  began  only  with  k -completeness  in  the  sense  of  M.  Kunen  [70:  188]  observed 
that  for  countable  M |=  ZFC,  having  an  M-ultrafilter  over  k in  his  weaker  sense 
is  equivalent  to  k being  weakly  compact  in  M.  With  normality  Kleinberg  [78] 
established  an  analogous  equivalence  with  a stronger  large  cardinal  property  (see 
Subtle  Properties  in  volume  II).  This  property  is  also  compatible  with  V = L,  but 
in  any  case  external  measurability  with  the  weak  amenability  condition  (b)  does 
lead  to  strength  within  M,  at  least  if  M \=  ZFC. 

With  U an  M-ultrafilter  over  k,  an  ultrapower  of  (M,  e,  U)  can  be  formed: 
For  any  / e K M fi  M,  set 

(f)u  = {g  U e KM  n M a {$  < * | /(f)  = g(f )}  e U}  . 

Thus,  only  functions  in  M are  considered  and  equivalence  classes  of  these  taken. 
To  accommodate  proper  classes  M set 

(f)°u  = {g  I 8 e (f)u  A Vh(h  e (f)v  rank(g)  < rank (h))} 

as  in  §5  so  that  the  (/)y’s  are  sets.  The  domain  of  the  ultrapower  can  then  be 
formulated  as 

*M/U  = {(f)°u  | / e K M D M] 
with  its  membership  relation  Ey  defined  by 

(/)?,  Eu  (g)0v  iff  {$<K  I /(I)  e g«)}  e U . 

Furthermore,  taking  Ce(A)  to  be  the  ambient  language  of  (M,  e,U),  the  new 
interpretation  Av  of  the  unary  predicate  symbol  A can  be  defined  appropriately: 

(/)?/  Ev  Av  iff  {§  < k | m eU}eU  . 

This  motivates  weak  amenability:  that  [f  < k \ f(f ) e U}  e M makes  this 
definition  possible. 

Proceeding  as  in  §5,  a crucial  use  of  M (=  ZFC~  is  to  check  that  Los’s 
Theorem  holds  for  (K M / U , Eu)',  the  Separation  Axiom  is  needed  for  its  very 
statement,  and  the  Axiom  of  Choice  for  the  existential  quantifier  step.  However, 
Ev  may  not  be  well-founded,  as  U is  a: -complete  only  for  sequences  in  M.  If  it 
is  well-founded,  by  the  Collapsing  Lemma  0.4  there  is  a transitive  N,  a W c N, 
and  an  isomorphism 

tv.  (KM/U,Eu,Au)  ->  (N,  e,W)  . 


We  can  then  set 


[f]u=n((f)°u)  for  feKMHM 
and 


j(x)  = [fx]u  for  x e M 

where  fx  is  the  constant  function:  k ->  [a'|.  Because  of  Los’s  Theorem, 
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j:  (M,  e)  < (N,  e) 

and  so  N |=  ZFC“  and  OnflMCOnfliV,  and 

crit(j)  = k and  [id]j/  = k 

by  normality,  where  id:  «■—>•«•  is  the  identity  function.  When  M is  an  inner 
model  of  ZFC  and  U e M,  the  foregoing  is  just  the  §5  ultrapower  construction 
carried  out  in  M,  so  that  W e N c M and  N is  a definable  class  in  the  sense 
of  M.  Flowever,  it  can  well  be  that  On  DM  < On  C\N:  Suppose  that  a:  is  a 
measurable  cardinal  and  U is  a normal  ultrafilter  over  k , and  consider  M — If  . 
Then  (k  M/U,  Ev)  is  well-founded,  and  there  is  an  N and  j as  above.  Since  this 
j is  the  restriction  to  M of  the  usual  embedding  defined  on  all  of  V, 

On  O M = k+  < 2K  < j(K)  <OnOiV. 

By  taking  a countable  elementary  substructure  of  an  amalgamation  of  M,  U,  and 
N and  forming  its  transitive  collapse,  we  can  get  countable  M0,  t/0>  and  Nq  such 
that  On  D Mq  < On  fl  No. 

Parts  of  the  next  lemma  recall  5.7;  as  usual,  the  subscript  U is  suppressed. 

19.1  Lemma. 

(a)  j is  cofinal:  for  any  y e N there  is  an  x e M such  that  y e j(x). 
Moreover,  if  y is  an  ordinal,  then  x can  be  taken  to  be  an  ordinal  as  well. 

(b)  If  M is  a set,  then  \M\  — /V|. 

(c)  j{x)  — x for  eveiy  x e VKC\  M,  VKC\M  — VKC\N,  V(k)C\M  — V{k)C\N, 
and  k+m  — k+n. 

(d)  U i N. 

(e)  W is  an  N -ultrafilter  over  j(ic). 

Proof,  (a)  If  y = [/]  say,  take  x = ran(/).  If  also  y e On,  then  we  can  assume 
that  ran(/)  C On  and  take  x = sup(ran(/))  + 1. 

(b)  \N\  < | KM  fl  M\  < \M\,  but  also  \M\  < |iV|  because  of  j . 

(c)  The  rank  function  can  be  defined  in  ZFC~  by  the  usual  recursion  (0.2). 
Using  this,  we  shall  show  that  for  each  a < k, 

(*)  j(x)  — x for  every  x e Va  0 M,  and  Va  0 M — Va  n fV  . 

This  proceeds  by  induction  on  a,  the  limit  stage  being  immediate.  For  the  suc- 
cessor stage,  suppose  that  (*)  holds,  and  let  x e M with  rank(x)  = a.  Then 
rankfjC*))  = j(oi)  — a by  definability  of  rank,  so  by  induction 

j(x)  = {y  e Va  n N | y e j(x)} 

= {y  e Va  n M \ y e j(x)} 

= {y  e Va  OM  I jiy)  e j{x)} 

= {y  I y e x} 


= X . 


19.  Iterated  Ultrapowers 


247 


This  also  implies  that  Va+\ flM  c Va+i  n A'',  so  it  remains  to  establish  the  converse 
to  complete  the  inductive  proof  of  (*): 

Suppose  that  x e N with  rank(x)  = a,  say  x = [/].  Since  j(a ) = a,  it 
can  be  assumed  that  rank(/(f))  = a for  every  f < k.  Set  u — [_Jran(/)  £ M. 
Assume  to  the  contrary  that  \u\  > k in  M,  so  that  there  is  a surjection  s\  u -*  k 
with  s e M.  Let  g:  k -»  u with  g e M such  that  ,s(g(f))  = f for  every  f < k. 
Since  rank(g(f))  < a for  every  f < k,  [g]  £ Van  N = Var\M,  say  [g]  = z e M, 
so  that  j(z)  = z = [g].  But  this  is  a contradiction,  since  g through  its  definition 
must  be  injective. 

It  follows  that  |n|  < k in  M.  But  then, 

x = {yeu\{%<K\ye  /(f)}  e U}  e M . 

The  equality  holds  since  for  any  y e x,  y e Van  N = VaC\M  and  j (y)  — y by 
induction,  and  membership  in  M follows  from  the  weak  amenability  of  { M , U). 

For  the  final  parts  of  (c),  note  first  that  if  X e V(k)  fl  M,  then  j ( X ) IT  k = 
X e N.  Conversely,  if  Y e V{ k)  fl  N,  say  Y — [/],  then 

Y = {aeK\{!;<K\ae  /(f)}  £ U}  , 

which  is  in  M by  the  weak  amenability  of  ( M , U) . That  k+m  — k+n  follows,  as 
M and  N contain  the  same  well-orderings  of  k. 

(d)  Assume  to  the  contrary  that  U £ N . Then 

V(k)  C\M  — UU{k  — X | X e U}  e N . 

Since  V{k)n  = V(k)  n N — V(k)  n M by  (c)  and  j ( k ) = {[/]  | / £ D M},  it 
is  not  difficult  to  see  that 

N 1=  3a  < j(K)3y3g(y  — V{oi)  A g:  y ->  / (at)  is  surjective) 

since  we  can  take  a = k and  define  an  appropriate  g from  U £ N using  Replace- 
ment. But  then, 

M \=3a  < ic3y3g(y  — V(a)  A g:  y — > k is  surjective)  , 

and  this  is  enough  to  derive  a contradiction  by  the  argument  of  2.8.  (Alternately, 
let  h : k — > y with  h £ M satisfy  g(/z( f ))  = f for  every  f < k.  Then  [ft]  £ 
Va+i  D N — Va+i  fl  M by  (c),  and  as  in  its  proof,  the  injectivity  of  ft  leads  to  a 
contradiction.) 

(e)  To  verify  the  weak  amenability  of  ( N , W),  suppose  that  F £ n N, 
say  F — [/].  For  each  f < k it  can  be  assumed  that  /(f)  £ KM,  and  so  a 
function  f'.KXK  -a  M can  be  defined  by:  /( f,  rj)  = /(f)(??).  Since  / £ 
M,  X — {(f , r})  | /(f , rj)  £ U}  £ M by  the  weak  amenability  of  (M,  U).  Now 
define  g £ KM  by:  g(f ) = {?;  | (f , ;;)  £ X}.  Then  g £ M,  and  for  any  ft  £ KMHM, 
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[/*]  e [g]  iff  {§  < k | A($)  e g(|)}  e £/ 

iff  {§  < * | /(§)(*($))  eU}eU 
iff  F([h])  e W , 

i.e.  [g]  = {a  < j(K)  | F(a)  e W}. 

It  is  similarly  straightforward  to  show  that  ( N , e,  W)  \=  W is  a normal 
ultrafilter.  H 

The  import  of  (d)  is  that  whether  or  not  U e M,  it  is  definitely  the  case  that 
U £ N.  Assertions  like  (a)  become  pertinent  since  it  is  not  necessarily  the  case 
that  On  fl  M — On  (T  N. 

Recalling  5.6,  elementary  embeddings  also  give  rise  to  ultrafilters  in  the 
present  context: 

19.2  Exercise.  Suppose  that  M and  N are  transitive  e-models  o/Z FC_,  k:  M < N 
with  crit(A:)  = k,  and  V{k)  O M = V(k)  fl  N.  Then 

{X  e V(k)  0 M | k e k(X)} 


is  an  M -ultrafilter  over  k.  H 

The  process  of  iterating  the  ultrapower  construction  is  now  described.  Starting 
with  an  M-ultrafilter  over  k,  if  the  ultrapower  of  M by  U is  well-founded,  we  can 
define  N,  W,  and  j as  above  and  by  19.1(e)  repeat  the  process  with  N and  W, 
continuing  through  successor  stages  as  long  as  ultrapowers  are  well-founded.  At 
limit  stages  we  will  have  engendered  a directed  system  of  models  and  embeddings 
as  reviewed  in  §0,  and  so  we  can  take  a direct  limit.  The  following  observation 
shows  that  the  process  can  be  continued  if  the  result  is  a well-founded  structure. 
For  convenience,  let 

e:  (X,  e,  R)  {Y,  E,  S) 

indicate  that  (i)  e:  (X,  e)  -<  { Y , E),  i.e.  e is  elementary  for  Ce  formulas;  and  (ii) 
e also  preserves  the  predicate,  i.e.  e is  an  embedding  of  (X,  e,  R)  into  (Y,  E,  S). 
With  regard  to  (i),  recall  our  convention  emanating  from  5.1(c)  that  for  transitive 
proper  class  e-structures,  elementary  is  E\  -elementary.  For  (ii),  note  that  if  R is 
definable  in  ( X , e)  using  parameters,  then  is  equivalent  to  full  elementary 
embedding. 

19.3  Exercise.  Suppose  that  for  each  a < S,  Wa  is  an  Na-ultrafilter  over  Xa  and 
that 

(((Na,e,Wa)  \ a < 8),  (jap  \ a < 0)) 

is  a directed  system  of  < embeddings  which  has  a well-founded  limit.  Then  it  has 
a transitive  collapse  ( N , e,  W),  and  if for  a < <5,  jap.  ( Na , e,  Wa)  ^ ( N , e,  W) 
is  the  direct  limit  embedding  modulated  by  the  transitive  collapse,  then  W is  an 
N -ultrafilter  over  X — jas(Xa)  for  some  (and  so  any)  a < 8. 
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Hint.  Toward  showing  that  the  Collapsing  Lemma  0.4  is  applicable  to  the  di- 
rect limit,  note  that  it  is  set-like:  Let  (D,  E,  A)  denote  the  direct  limit  and 
ia : (Na,e,Wa)  — > ( D,E,A ) the  corresponding  embeddings  for  a < 8.  Sup- 
pose that  a e D,  so  that  for  some  a < 8 there  is  a b e Na  such  that  ia(b ) = a. 
Then  for  any  x e D, 

x E a iff  3/33y(a  < /3  < 8 A ip(y ) = x A y e jap(b))  , 

and  so  {r  e D | x E «}  is  a set  by  Replacement. 

For  the  latter  assertion,  the  verification  of  weak  amenability  is  typical:  Sup- 
pose that  F e XN  fl  N,  so  that  F — ja$(F ) for  some  a < 8 and  F e k“Na  n N„. 
By  the  weak  amenability  of  (Na,  Wa ), 

x = {§  < Xa  I F(f)  e Wa}  e Na  , 

and  hence 

jafX)  = {§<11  F(§)  e W}  e N . A 

For  any  M-ultrafilter  U over  k,  we  can  thus  proceed  recursively  to  define 
Cfz(A)  structures  (Ma,  e,  Ua)  for  a < r where  U„  is  an  Ma -ultrafilter  over  tca, 
and  embeddings  iap:  { Ma , e,  Ua)  (Mp,  e,  Up)  for  a < ji  < r as  follows: 
Set  Mo  = M,  Uq  — U,  kq  — k,  and  too  the  identity  on  M.  Having  defined 
Ma,  Ua , Ka , and  iap  for  a < fi  <8,  there  are  two  cases: 

(i)  8 is  a successor  ordinal,  say  8 — y + 1 . If  the  ultrapower  of  My  by  Uy  is 
well-founded,  let  Ms  be  its  transitive  collapse  and  Us  c Ms  such  that 

j:  ( My,e,Uy ) ■<-  ( Ms,e,Us ) 

is  the  corresponding  embedding.  Set  ks  = j(Ky),  iys  — j,  ias  = j°iay  fora  < y, 
and  iss  the  identity  on  Ms.  If  on  the  other  hand  the  ultrapower  is  ill-founded,  set 
8 — x. 

(ii)  8 is  a limit  ordinal.  If  the  direct  limit  of 

e,  Ua)  | a < 8),  (iap  | a < fi)) 

is  well-founded,  let  Ms  be  its  transitive  collapse  and  Us  c Ms  such  that  for  each 
a < 8 there  is  a direct  limit  embedding: 

(Ma.e,Ua)  -<-  (Ms,  e,  Us) 

modulated  by  the  transitive  collapse.  Call  this  embedding  ias  for  a < 8,  set 
ks  = ias(Ka)  for  some  (and  hence,  any)  a < 8 and  iss  the  identity  on  Ms.  If  on 
the  other  hand  the  direct  limit  is  ill-founded,  set  8 — r. 

If  this  definition  proceeds  through  all  the  ordinals,  set  r = On.  Otherwise, 
r e On  denotes  the  least  stage  at  which  the  iteration  process  encounters  ill- 
foundedness.  We  could  have  trudged  on  through  ill-founded  structures  with  only 
notational  complications,  but  such  toil  would  be  gratuitous  for  present  purposes. 
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To  summarize  the  notation, 

(Ma,  Ua,  Ka,  iap)a<peT  is  the  iteration  of  (M,  G,  U)  ; 

r is  the  length  of  the  iteration;  and  for  a G r,  (Ma,  G,  Ua)  is  an  iterate,  specifically 
the  ath  iterate,  of  (M,  e,U).  Also, 

(M,  e,U)  is  iterable  (and  U is  an  iterable  Af -ultrafilter)  iff  r = On  . 

Concerning  formalizability,  note  that  even  if  M were  a proper  class,  the  iteration  of 
(M,  G,  U)  can  be  regarded  as  a single  class  definable  from  M and  U.  In  particular, 
if  M is  an  inner  model  of  ZFC  and  U G M,  then  the  iteration  is  definable  entirely 
in  M. 

Iterability  is  a substantial  hypothesis,  one  that  will  soon  be  considered  in 
detail.  But  for  now,  we  fix  (M,  g,  U)  and  its  iteration  as  given  above,  and  proceed 
through  several  lemmata  to  establish  some  basic  properties. 

19.4  Lemma.  Suppose  that  a < f>  G r. 

(a)  crit(iap)  = Ka  and  iapiKc)  — Kp. 

(b)  iap(x)  = x for  every  x G VKa  IT  Ma,  VKa  D Ma  — VKa  IT  Mp,  and  V(jca)  IT 
Ma  = V(Ka)  n Mp. 

(c)  If  f is  a limit  ordinal,  then  Kp  — sup({Ary  | y < f}). 

(d)  If  M is  a set,  \Ma\  = \M\  ■ |ar|. 

Proof  (a)  follows  directly  from  the  construction,  and  (b)  is  a consequence  of 
19.1(c).  For  (c),  if  £ < Kp,  then  by  definition  of  direct  limit  § = iyp(f)  for  some 
y < f and  £ < ky.  Hence,  § = § < ky. 

For  (d),  \Ma\  < \M\  ■ |a|  follows  by  induction  on  a,  using  19.1(b)  and 
bounding  direct  limits.  Conversely,  M is  injectible  into  Ma  and 
{ky  | y < a}  c Ma,  and  so  \M\  ■ |a|  < \Ma\.  H 

For  any  a+ler  and  X e V(Ka)  T Ma, 

X e Ua  iff  {?  | ? e T)  e iff  Ka  e ;'Q!ia.+i(A) 

by  normality.  This  leads  to  a further  characterization  at  limit  ordinals  crucial  to 
later  definability  results  - another  testimonial  to  the  efficacy  of  the  concept  of 
normality. 

19.5  Lemma  (Kunen  [68, 70]).  Suppose  that  f is  a limit  ordinal  less  than  r.  Then 
for  any  X e V(Kp)  T Mp, 

X e Up  iff  3 a < P({ky  I a < y < c A)  . 

Proof  Suppose  that  y < f and  X = iYp(X)  for  some  X e V(ky)  T My.  Then 

X e Up  iff  X e UY 

iff  Ky  G iy  y_|_l(X) 

iff  Ky  G lypiX)  = X , 


19.  Iterated  Ultrapowers 


251 


since  iY+\^(jcY)  = ky.  Thus,  for  any  y < fl,  if  X e ran (iyp),  then  X e Up  iff 
ky  e X.  This  suffices,  by  the  definition  of  direct  limit.  H 

Thus,  Up  is  generated  by  the  closed  unbounded  set  {ky  \ y < f}  c Kp,  and 
so  in  particular,  this  set  cannot  belong  to  Mp.  The  following  result  provides  a 
representation  of  iterated  ultrapowers  that  generalizes  5.13(a)  and  plays  a basic 
role  in  the  structure  theory. 

19.6  Lemma.  For  any  a e r and  x e Ma,  there  are  n e co,  f e ^"M  D M,  and 
Y\  < ...  < yn  < a such  that  x = «oa  (/)(*>, , . . . , kyJ. 

Proof.  Proceed  by  induction  on  a.  Suppose  that  the  proposition  holds  for  a, 
a + 1 e r,  and  x e Ma+\.  By  5.13(a),  x = i«,a+i  (glfa)  for  some  g e K“ManMa, 
so  we  are  done  if  a — 0.  Otherwise,  by  induction  g — ioa(h)(KYl, . . . , KVn)  for 
some  n e to,  h e W'M  D M,  and  y\  < . . . < yn  < a,  where  it  can  be  assumed  that 
ran(70  consists  of  functions.  Define  / e W"+‘m  n M by:  f(f\, , fn+ 1 ) = 
hif\ |«)(|„+i).  Then: 

fo.o'+l  (/”)  • i Ky,i’  tC(f)  — fo.a+l  (h ) (/Cyj , . . . , Kyn  ) (/Cq.) 

— fo'.o'+l  (ioa(h)  (kYi  , . . . , Kyn  ))  (/Cq.) 

= fo'.o'+l  (g)  (^a) 

= X . 

If  S < t is  a limit  ordinal  and  x e Mg,  then  x = ias (x)  for  some  a < 8 and 
x e Ma.  By  induction  x — ioa(f)(Kyi,  ■ ■ . , kYii)  for  some  n e a>,  f e n M, 
and  y\  < ...yn  < a.  Hence,  v = ios{f)(KYl, . . . , KYn)  since  ias(KYi ) = KYi  for 
1 < i < n.  H 


Ma  is  thus  generated  in  a concrete  fashion  from  ran(/off)  and  {ky  \ y < a}. 
This  leads  the  following  corollary  on  the  action  of  the  ioa’s  on  ordinals;  the 
cardinals  are  in  the  sense  of  V as  usual. 

19.7  Corollary. 

(a)  If  f € On  fl  M and  a e x,  then 

iteG)  < (r§  n Ml  • H)+  . 

(b)  If  v is  a cardinal  such  that  fl  M\  < v e r,  then 

kv  = iov(K0 ) = v . 

(c)  If  6 is  a cardinal,  M \=  rZFC  A 6 is  a strong  limit  of  cofinality  > if, 
and  a < min ((9,  r),  then 


ioa(0)  — 9 . 
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Proof,  (a)  This  follows  from  cardinality  considerations,  since  19.6  implies  that 
rj  < ioc(f)  iff  r)  — ioa(f  )(KYl , . . . , kyJ  for  some  n e a>,  f e fl  M,  and 
yi  < ...  < y„  <a. 

(b)  We  have 

v < kv  = sup({K„  | a;  < v})  < sup({(|*/r  fl  M | • |a|)+  | a < v})  < v . 

The  equality  follows  from  19.4(c),  and  the  middle  inequality  from  (a)  above. 

(c)  It  suffices  to  show  assuming  r]  < iOa(0)  that  rj  < 6.  By  19.6,  ;;  = 
ioct(f)(Kyi,  ■ ■ ■ , Ky„)  for  some  n e co,  f e ^"6C\M,  and  Y\  < ■ ■ ■ < Yn  < a.  Since 
cf(0)  > k in  M,  there  is  a % <0  such  that  / e Hence, 

fl  <i0a^)  <(\^nM\-\a\)+  <6  . 

The  second  inequality  is  from  (a),  and  the  third  follows  from  the  hypotheses  on 
6.  H 


19.6  veers  toward  Kunen’s  original  formulation  of  iterated  ultrapowers;  the 
beginnings  of  that  formulation  are  broached  as  part  of  this  development:  For 
n e co,  set 

Un  = {X  e V([k]”)  DM  \3H  e U([H]n  c X)}  . 

Thus,  U"  is  a (definable)  class  in  the  sense  of  (M,  e,  U).  The  normality  of  U 
together  with  weak  amenability  shows  that  this  provides  the  appropriate  notion  of 
u-fold  product: 


19.8  Exercise. 

(a)  For  n e co  and  X e V([K~\n+l)  fl  M. 

X e Un+l  iff  {s  e M"  | {§  < k | s U {f } e X}  e U)  e Un  . 

(b)  For  0 < n e co, 

( M , e,  U)  1=  Un  is  a k -complete  ultrafilter  over  [/c]"  . 

Hint.  Verify  (a)  along  the  way  to  (b);  Rowbottom’s  7.17  holds  for  U and  those 
/:  [k]"  — >•  2 such  that  / e M since  U is  an  M -ultrafilter.  H 


Temporarily  liberalizing  conventions,  for  0 <n  e o>  we  can  proceed  to  form 
the  ultrapower 

[K]"M/Un  = {(/)“,„  | / e [K]"M  D M} 
and  a corresponding  embedding 

k„:  ( M , e,  U)  {l [K]"M/Un , Ev„,  Av. ) 


given  by 

Kix)  = (fxfijn  for  x e M 
where  fx  is  the  constant  function:  [k]"  ->  {v}. 
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The  next  result,  together  with  the  representation  19.6,  provides  a reduction  of 
the  satisfaction  relation  for  an  iterate  to  that  of  (M,  e,  U),  and  shows  in  particular 
that  the  ultrapowers  just  defined  are  isomorphic  to  the  finite  iterates  of  { M , e,  U). 
Note  that  U°  = {0}  by  definition.  For  proper  classes  M,  (a)  below  and  the 
coming  corollary  should  be  taken  to  be  schemas  of  theorems,  one  for  each  class 
and  formula. 

19.9  Lemma. 

(a)  For  any  formula  cp(v  o, . . . , v„ ) of  Ce,  x e M,  and  y\  < . . , < y„  < a e r, 

( M a 5 \—  ^[ioo'(v),  Kyx  , . . . , Kyfi 

iff  (M,  e,  U)  \=  {{§1,  ...,&}  e [Kf  I <p[x,  ?„]}  e Un  . 

(b)  For  new  with  0 < n e x,  n„:  (^"M/Un,  Ev AVn)  — »•  (M„,  e,  U„)  given  by 

n n((f)°u»)  = ion(f)(KO,  • • ■ , K„- 1) 

is  a well-defined  isomorphism.  Flence,  it  is  the  collapsing  isomorphism  onto  the 
transitive  collapse.  Moreover,  io„  = nn  o kn. 

Proof,  (a)  Proceed  by  induction  on  n;  the  case  n — 0 is  immediate.  Generally, 
{Ma,  e)  |=  (p\_ioa{xf  K) /j , . . . , iCyn ] 

iff  {My„+ u e)  j=  <p[«o,y„+i(x),  *v„] 

# {My„,e,  Uyfi  |=  {£  < U UYn  I <P[*0y„(*)>  Kyi’---’  KYn- 1.  ?]}  e ^ 

iff(M,e,U)  \=  6,-i}e[«r’  I 

{£  < U t/  I «!>[*,  £»-i,  $]}  e £/}  e C/"-1 

iff  (M,  e,  £/>  N {{?i,  e M"  | ?[*,  ft, ... , $„]}  e t/”  . 

(Of  course,  1J  UYn  is  kYh  and  (J  U is  k.) 

(b)  This  is  an  application  of  (a)  and  19.6  with  a — n.  H 

19.8(a)  suggests  the  general  notion  of  product  ultrafilter.  In  general  model- 
theoretic  terms,  that  finite  iterations  of  ultraproducts  can  be  rendered  as  a single 
ultraproduct  by  a product  ultrafilter  had  been  observed  by  Frayne-Morel-Scott  [62], 
For  technical  reasons  obviated  by  the  representation  19.6  for  normal  ultrafilters, 
Kunen  carried  through  a similar  analysis  into  the  transfinite,  representing  all  iter- 
ates as  single  ultrapowers  of  sorts.  Independently,  Keisler  also  developed  such  a 
representation  (Chang-Keisler  [90:  6.5]). 

The  following  immediate  corollary  of  19.9(a)  shows  how  iterated  ultrapowers 
lead  to  indiscemibles.  This  insight  of  Kunen’s  led  to  a new  structural  understand- 
ing of  0#  (21.1). 

19.10  Corollary  (Kunen  [68,  70]).  For  any  (p{v o,  . . . , vn)  of  C€, 
x e M,  yi  < . . . < yn  < a e r,  and  <5 1 < . . . < 8n  < ft  e r, 
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{Ma,  e)  \=  (p[i0a(x),  Kn,...,  ky J iff  (Mp,  e)  |=  (p[i0p(x),  kSi,...,  ks  J . 

In  particular,  {/cY  y < a]  is  a set  of  indiscernibles  for  (Ma,  e,  ioa(x))x€M- 

H 


Iterability 

Continuing  in  terms  of  (M,  e,  U)  and  its  iteration  (Ma,  Ua,  Ka,  iap) a<pex,  several 
characterizations  of  iterability  are  established  next.  For  convenience,  On  is  treated 
as  an  ordinal  when  discussing  lengths  of  iterations.  For  example,  r > an  is 
understood  to  include  the  possibility  r = On.  Indeed,  the  historically  first  result 
about  iterability,  due  to  Gaiffnan,  was  the  assertion  that  r = On  iff  x > co\. 
This  is  established  as  part  of  a larger  framework,  one  that  incorporates  a sufficient 
condition  due  to  Kunen: 

U is  countably  complete  iff  for  any  {X„  \ n e co]  c U,  Xn  0 . 

This  suffices  for  the  forward  direction  of  5.3;  {Xn  \ n e to}  may  not  be  a member  of 
M,  and  for  that  reason  requiring  P|);  Xn  e U as  in  co\ -completeness  would  be  too 
stringent.  Kunen  showed  that  this  external  completeness  implies  iterability.  For 
later  characterizations,  the  implication  is  factored  through  a property  attributable 
to  Jensen  and  implicit  in  Dodd  [82:  65]: 

{ M , e,  U)  is  countably  iterable  iff  for  any  fV-ultrafilter  W with 
(N,  e,  W)  countable  and  embeddable  into 
(M,  e,  U),  the  length  of  the  iteration  of  (N,  e,  W ) is  > oj\  . 

19.11  Lemma  (Jensen).  If  U is  countably  complete,  then  ( M , e,  U)  is  countably 
iterable. 

Proof.  Suppose  that  W is  an  N -ultrafilter,  ( N , e,  W)  is  countable,  and 
e:  ( N , e,  W)  (M,  e,  U)  . 

Letting  (Na,  Wa,  Xa,  jap)a<p€o  be  the  iteration  of  (N,  e,  W)  it  must  be  verified 
that  a > co i . To  do  this,  proceed  by  induction  on  a < u>\  to  establish  that: 

(i)  (Na,  6,  Wa)  is  defined,  and  for  inductive  purposes  , 

(ii)  there  is  an  ea\  ( Na , e,  Wa)  ■< ~ ( M , e,  U)  such  that 
ey  — ea  o jya  for  y < a . 

Initially  set  eo  — e.  For  the  induction  argument  at  a + 1,  suppose  that  (i) 
and  (ii)  have  been  satisfied  through  stages  < a.  In  particular,  for  any  X e Wa, 
ea(X)  e U.  Also,  since  N is  countable  and  a < a>\,  Na  is  countable  by  19.4(d). 
Hence,  there  is  an  ;/  e Pl{e„(Z)  | X e Wa}  by  the  countable  completeness 
of  U.  Now  r)  defines  an  embedding  of  the  ultrapower  (Xa Na/Wa,  EWa,  AWJ 
of  (Na,e,Wa)  into  ( M,e,U ) by:  j((f)°Wa)  — ea(f){n).  Moreover,  j is  £e 
elementary:  Using  a one-variable  formula  tp(v o)  for  simplicity, 
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(■ KNa/Wa,EwJ  \=<p[(f)°wj 

iff  {f  < | {Na,  e)  \=  <p[fm]  e 

iff  rjetf  <k  | {M,e)\=q>[ea(mm 
iff  (M,e)^<p[ea(f)m- 

But,  ( KNa/Wa , EWa,  AWa)  is  well-founded,  being  embeddable  into  a transitive 
structure,  and  so  {Na+ 1,  G,  Wa+ 1>  is  duly  defined  as  its  transitive  collapse.  Con- 
sequently, j modulated  by  the  collapsing  map  yields  an  embedding 


£a+l-  {Noi+l,  Wa+l}  “<  {M , G,  U)  ■ 


Furthermore,  it  is  simple  to  check  that  ea  — ea+i  o ja.a+i,  and  hence  that  for  any 

y <C  (X,  Cqi  o jya  6a- (-1  O jyta+l- 

Suppose  now  that  8 < co\  is  a limit  ordinal  and  (i)  and  (ii)  have  been  satisfied 
through  stages  < 8.  In  particular,  the  direct  limit  of 

{{{N„,  e,  Wa)  | a < 8 },  {jap  | a < f)) 

is  embeddable  into  (M,  e,  U)  by  0.7  because  of  the  efs.  Hence,  this  direct 
limit  is  well-founded,  and  so  {N$,  e,  W$)  is  duly  defined  as  its  transitive  collapse. 
Moreover,  that  embedding  of  the  direct  limit  into  { M , e,  U)  modulated  by  the 
collapsing  map  yields  an  embedding 


es:  {Ns,  e,  Ws > (M,  e,  W) 


such  that  eY  — es  o jyS  for  y < 8.  H 

19.12  Lemma.  If  ( M , G,  U)  is  countably  iterable,  then  it  is  iterable. 

Proof.  Assume  to  the  contrary  that  r G On.  If  r is  a successor,  say  r = 
y + 1,  then  temporarily  let  (Afr,  Ex,  Ur)  be  the  ultrapower  {** MY/Uy,  EUy,  AUy) 
of  {My,  g,  Uy),  and 

ip  ( M,e,U ) -<-  (Mt,Et,Ut) 

the  composition  of  ioY  followed  by  the  natural  embedding  into  that  ultrapower. 
If  r is  a limit  ordinal,  let  {Mz,  Er,  UT)  be  the  direct  limit  of  the  iteration  of 
(M,  G,  U),  and 

ip  (M,  G,  U)  -<-  {Mr,  Er,  Ur) 

the  corresponding  embedding.  In  either  case  (Mr,  Ez,  Uz)  is  ill-founded  by  as- 
sumption. Nonetheless,  a look  at  the  inductive  proofs  of  19.6  and  19.9(a)  shows 
that  they  also  hold  “at  r”,  i.e.  with  a = r,  {Ma,  G,  Ua)  replaced  by  (Mz,  E z,  UT), 
and  i()a  replaced  by  iT  - if  in  the  case  r = y + 1,  ky  is  replaced  by  [id]^  where 
id:  ky  —*■  ky  is  the  identity.  This  is  how  19.6  and  19.9(a)  are  applied  “at  r”  in 
what  follows. 
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Let  [xk  | k e co}  c.  Mz  such  that  Xk+\  Ex  xk  for  each  k e co.  By  19.6  “at  r”, 

Xk  — ix{fk)(.Ky*,  • • ■ , Kykt ) for  some  n(k)  G co  , 

fk  e [K]'mM  n M,  and  Y\  < ■ ■ ■ < Yn(k)  < r . 

Take  a Skolem  hull  of  [fk  \ k g co}  in  (M,  e,U),  and  then  its  transitive  collapse. 
This  produces  a countable  (N,  G,  W)  where  W is  an  IV -ultrafilter,  and  the  inverse 
of  the  collapsing  map  is  an  embedding 

e:  (N,  e W)  -<~  (M,  g,  U)  . 

Let  fk  satisfy  e(fk)  = fk  for  each  k e a>.  By  hypothesis,  the  iteration 
(Na,  Wa,  ka,  jap)a<fiea  of  {N , G,  W)  satisfies  a >(o\. 

Let  £ < co i be  the  ordertype  of  the  set  Z = {y^  \ k G w A 1 <m<  n(k)} 
and  h:  Z ->  f the  unique  order-preserving  function.  For  each  k G co,  set 

<5*  = for  1 < m < n(k)  , and 

xk  = J0f(/t)(^,  ■ • • , • 

Furthermore,  let  <pk(vo,  v\, . . .)  be  a formula  such  that  <pk[iT(fk),  h(fk+ 1)- • ■ •] 
asserts  that  v>+ 1 Ez  Xk  in  {Mr,  Er,  Ur),  where  . . . denotes  the  listing  of  the  set 
{i<vk, . . . , Kvk  , km  , . . . , Kt+ 1 } in  increasing  order.  Finally,  let  <pf(  vq,  i>i)  be  the 

' 1 ' n(k)  'I  'n(k+ 1) 

formula  to  the  right  of  (M,  G,  t/)  ^ in  the  application  of  19.9(a)  “at  r”  to  <pk. 
Then  for  each  k e co, 

Xk+ 1 Ez  xk  iff  ( M , G,  t/)  |=  %*[/«■-  /*+i] 
iff  (N,G,W)^<p*k[fk,Jk+1] 
iff  xk+ 1 e , 

by  using  (pk  appropriately  in  (N$,  G,  Wk).  This  of  course  is  a contradiction,  and 
so  the  proof  is  complete.  H 

19.13  Corollary  (Kunen  [68,70]).  If  U is  countably  complete,  then  ( M , G,  U)  is 

iterable.  ~\ 

The  converse  is  false,  since  there  are  iterable  (N,  G,  W)  that  are  countable 
(19.15(e))  in  which  case  W cannot  be  countably  complete.  Flowever,  we  do  have 
the  following: 

19.14  Lemma.  If  ( M , G,  U)  is  iterable,  then  it  has  an  iterate  ( Ma , G,  Ua)  such 
that  Ua  is  countably  complete. 

Proof  This  is  affirmed  with  a — cop.  Suppose  that  {Xn  \ n G co]  c UMl . By  19.5, 
for  each  n G co  there  is  an  an  < co\  such  that  {ky  \ a„  < y < cof  c Xn.  Let 
f>  = sup({a„  | n G oo}).  Then  tcp  G p|„  xn-  H 


19.  Iterated  Ultrapowers 


257 


The  arguments  for  these  lemmata  can  now  be  used  to  concoct  a daisy  chain 
of  implications  that  provides  several  characterizations  of  iterability: 

19.15  Theorem  (Gaifman  for  (b),  Jensen,  Dodd  [82]).  The  following  are  equiva- 
lent: 

(a)  (M,  G,  U)  is  iterable,  i.e.  r — On. 

(b)  x > co\. 

(c)  There  is  an  iterate  (Ma,  e,Ua)  such  that  Ua  is  countably  complete. 

(d)  ( M , G,  U)  is  countably  iterable. 

(e)  Any  { N , G,  W)  -<~  embeddable  into  ( M , G,  U)  is  iterable. 

Proof,  (a)  -»  (b)  is  immediate.  For  (b)  ->  (c),  note  first  that  r > co\  implies  that 
r > coi,  i.e.  there  is  an  a)\ -iterate,  since  the  corresponding  direct  limit  is  readily 
seen  to  be  well-founded.  Hence,  (c)  follows  by  the  argument  for  19.14.  For  (c) 
->  (d),  note  that  ( Ma , g,  Ua)  is  countably  iterable  by  19.11,  and  so  (M,  G,  U)  is 
countably  iterable  since  it  is  embeddable  into  (Ma,  G,  Ua).  For  (d)  — »■  (e),  note 
that  ( N , g,  W)  is  countably  iterable,  and  use  19.12.  (e)  — >•  (a)  is  immediate.  H 

The  following  absoluteness  result  is  a consequence: 

19.16  Proposition.  Suppose  that  S is  a transitive  e-model  of  ZFC,  o>\  c S,  and 
M,  U C S are  classes  in  the  sense  of  S such  that  S |=  U is  an  M-ultrafilter.  Then 

(M,  g,  U)  is  iterable  iff  S \=  ( M , g,  U)  is  iterable  . 

In  particular,  if  U is  an  M-ultrafilter,  M |=  ZFC,  and  co\  U {t/}  C M,  then 
( M , G,  U)  is  iterable. 

Proof.  Clearly  U really  is  an  M-ultrafilter,  and  in  particular  M really  models 
ZFC'  - assertions  implicit  in  the  conclusion.  The  hypotheses  further  imply  by 
absoluteness  of  satisfaction  that  for  any  a < co\,  (( Ma , G,  Ua))s  = { Ma , G,  Ua) 
when  both  are  defined.  Moreover,  for  class  structures  in  the  sense  of  S,  well- 
foundedness  is  absolute  (0.3).  Hence,  if  S |=  (M,  G,  U)  is  iterable,  then  the  (real) 
iteration  of  (M,  G,  U)  has  length  > a>\,  and  so  by  19.15  it  is  iterable.  The  converse 
is  clear. 

The  last  assertion  follows  from  taking  S = M and  noting  that  M |=  U is 
countably  complete.  H 

A Lemma  on  Measurable  Cardinals 

Tucked  in  next  is  a striking  result  of  Kunen’s,  a dividend  of  his  work  on  iterated 
ultrapowers.  Chang  [71]  formulated  for  cardinals  k the  class  CK  of  sets  con- 
structive using  the  infinitary  language  LKK  (described  in  §4)  and  observed  various 
generalizations  of  properties  of  L — C"‘ . For  example,  Ck  is  the  least  inner  model 
M closed  under  the  taking  of  arbitrary  <k -sequences,  i.e.  <KM  c M.  Also, 
Scott’s  result  5.5  can  be  extended  to  show  that  if  there  is  a measurable  cardinal  k. 
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then  V / CK+ . Sureson  [86]  studied  transcendence  over  these  models  in  terms  of 
indiscemibles,  and  showed  in  particular  that  transcendence  over  C®1  is  equicon- 
sistent  with  the  existence  of  a measurable  cardinal.  Using  iterated  ultrapowers 
Kunen  [73]  showed  that  unlike  L these  models  do  not  generally  have  intrinsic 
well-orderings:  if  k is  regular  and  there  are  k+  measurable  cardinals  then  the 
Axiom  of  Choice  fails  in  CK+ . Toward  this  result  he  established  a striking  lemma 
that  revealed  an  unexpected  global  constraint  on  measurable  cardinals  imposed  by 
ZFC.  Here,  a proof  due  to  William  Fleissner  is  provided  that  does  not  use  iterated 
ultrapowers.  As  usual,  if  k is  measurable  and  U is  a k -complete  ultrafilter  over 
k , then  jij:  V -<  Mu  = Ult(F,  U)  and  [f]u  e Mv  corresponds  to  /:«■—>■  V. 

19.17  Theorem  (Kunen  [73]).  For  any  j,  the  following  set  is  finite: 

{k  > co  | there  is  a k -complete  ultrafilter  U over  k such  that  H < JuG)}  ■ 

Proof  (Fleissner  [75]).  First,  stipulate  for  U a v-complete  ultrafilter  over  k > co 
that  a half-open  interval  of  ordinals  [a,  P)  = {§  I a < £ < P)  is  a moving  internal 
for  U iff  ju  fixes  cofinally  many  ordinals  less  than  a as  well  as  P,  yet  it  moves 
every  ordinal  in  [a,  /?).  Note  that  if  these  conditions  are  satisfied,  then 

(i)  cf(a)  = k , and 

(ii)  p = sup({j"(f)  | n e co})  for  any  f e [a,  P)  , 

where  j”,  indicates  n iterative  applications  of  ju-  (For  (i),  sup({jufi)  \ rj  < a})  = 
a since  cofinally  many  ordinals  less  than  a are  fixed  by  ju,  so  a standard  argument 
shows  that  ju(oi)  > a necessitates  cf(a)  — ic.) 

Next,  observe  that  if  U is  a ^-complete  ultrafilter  over  k > co  and  W is 
a /(.-complete  ultrafilter  over  X > k,  then  jv  and  jw  commute:  For  any  set  x, 
ju(x ) = {[f]u  I / € Kx}  and  since  jw  fixes  k and  U, 

jwiju(x))  = {[/];/  | / e K jw(x)}  — juijwixj)  ■ 

It  follows  that  with  U and  W as  above  if,  [a,  P)  is  a moving  interval  for  U and 
[a,  P)  is  a moving  interval  for  W,  then  either  they  are  disjoint  or  strictly  nested, 
i.e.  if  a < a < p,  then  p < p.  To  show  this,  first  choose  an  ?;  such  that 
a < i]  < a and  jw(>l)  — hi  this  is  possible  by  hypothesis  on  a.  By  (ii)  above, 
there  is  an  n e co  such  that  a < j^fi ) < p.  Since  ju  and  jw  commute, 

JwUuW)  = JaUwm  = j'Pfi)  ■ 

Hence,  P < jufi)  since  [a,  P)  is  a moving  interval  for  W,  and  so  P < P- 

The  proof  can  now  be  completed:  Assume  to  the  contrary  that  for  some  £ 
there  is  an  increasing  sequence  (at,  i £ op  of  measurable  cardinals  such  that 
for  each  i e co  there  is  a k,  -complete  ultrafilter  If  over  k,  with  £ < jufH)-  It 
follows  that  for  each  i e co,  ^ is  in  a moving  interval  [a,-,  /?,)  for  By  (i)  above, 
i / j implies  that  a,-  / aj,  so  that  we  can  assume  by  taking  a subsequence  that 
i < j implies  that  a,;  < aj.  But  then,  as  the  [a, , /!,■)’ s all  contain  |,  they  are 
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nested  intervals,  and  so  /So  > P\  > Pi  ■ ■ ■ is  a descending  sequence  of  ordinals. 
Contradiction!  H 

Friedrich  Wehrung  [89]  gave  another  proof  of  Kunen’s  result  and  established 
related  results. 

A Connection  with  Prikry  Forcing 

This  section  is  concluded  by  establishing  an  interesting  connection  between  iterated 
ultrapowers  and  Prikry  forcing.  Suppose  that  M is  an  inner  model  of  ZFC,  U £ M, 
and  M \=  U is  a normal  ultrafilter  over  k.  Then  (M,  e,U)  is  iterable  by  19.16,  so 
let  {Ma,  Ua,  Ka,  iap)a<pe On  be  its  iteration.  Then  for  any  y,  ioy(U)  = UY  £ My, 
and  by  absoluteness  of  satisfaction  {{Ma,  £,  Ua)  \ y < a e On)  is  definable  in 
{My,  £,  UY)  as  its  sequence  of  iterated  ultrapowers.  In  particular,  a < /}  implies 
that  Mp  c Ma. 

For  limit  ordinals  S > 0,  set 

Mt  = a <sMa . 

Then  Ms  c Mi,  and  Mi  is  an  inner  model',  in  terms  of  our  formulation  before 
3.1, 

u Mi  c Mi  and  V£(def(M+  n Vf)  c M^+)  . 

For  the  latter,  if  x e def(M^  fl  Vj)  for  some  then  for  any  a < 8,  by  definability 
and  absoluteness  of  M/  and  rank  in  Ma  we  have  x e Ma,  and  hence  x e 

n a<sMa  = Mi. 

Focusing  on  S — ca,  note  that  {Kn  \ n e w}  e M+  — Mw:  For  each  i e co, 
{k„  ;<«£&>}£  Mj  because  of  the  definability  of  the  iteration  process  from 
Mi  on  within  Mj,  and  so  {k„  \ n £ co}  £ Mj  as  it  is  only  a finite  modification. 
However,  {k„  \ n e co}  ^ Ma  since  Km  = sup({/r„  | n £ «})  is  measurable  in  M01. 
Solovay  noted  that  {k„  \ n e co}  is  in  fact  Prikry  P^-generic  over  Mw;  Bukovsky 
[73]  observed  on  general  grounds  that  M+  is  a generic  extension  of  Mm;  and  then 
Dehornoy  and  Bukovsky  made  the  exact  connection: 

19.18  Theorem. 

(a)  (Solovay)  {k„  \ n £ w]  is  Prikry  PUai-generic  over  Mm. 

(b)  (Dehornoy  [75,78];  Bukovsky)  Mw[{k„  \ n e «}]  = M+. 

Proof,  (a)  This  is  a direct  consequence  of  18.7  and  19.5. 

(b)  Set  g = {k„  | n £ co};  it  remains  to  establish  that  M+  C Mc0[g\.  First, 

(*)  if  x £ Ma,  then  for  sufficiently  large  n £ co,  iIWJ(x)  = i^^ix)  . 

This  is  a consequence  of  the  following  for  sufficiently  large  n £ co:  There  is  a 
x„  £ Mn  such  that  i„M(x„)  = x,  i.e.  ( Mn , £,  U„)  ^ rthe  image  of  x„  under  the 
embedding  into  the  &>th  iterate  is  x\  Applying  ina),  { Mw , e,  Uf)  [=  rthe  image  of 
inm(Xn)  under  the  embedding  into  the  coth  iterate  is  ina>{x)\  But  as  the  embedding 
in  question  is  f .j . r,j 2 ■ i/vw (x ) — i(,>,<n2 ( ituo(xn ) ) — i<»,o>2 ) . 
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Next,  every  set  of  ordinals  in  M+  is  also  in  M0,[g\.  Suppose  that  x e 
V(y)  D M+.  By  19.6  and  definability  there  is  a sequence  (/„  | n e to)  e M 
such  that  for  each  n e to,  /„:  [/c]"  ->  M and  x = io«(/«)(^o>  ■ • • » k„~ i),  so  that 
i„w(x)  — iou,(fn)(K0, ... , Kn- 1)  as  in0J(Kj)  — Ki  for  i < n.  Since 

{ioco(fn)  I n eco)  — iocodfn  I « € tt>»  € M w , 

it  follows  by  definability  from  g that  (ino)(x)  \ n e to)  e Mm[g ].  Now  for  any 
£ < y,  | e x iff  for  sufficiently  large  n e to,  iim  (£)  e inco(x)  iff  for  sufficiently 
large  n e to  , l(o,(o 2 ^ hioj  Or ) by  t'i- ) above.  Bence,  by  the  definability  of  i(o,o) 2 
in  it  follows  that  x e M<y[g]. 

Finally,  M+  c MM[g]  can  be  established  by  induction  on  rank:  Suppose  that 
x e M+  and  inductively,  x c Mm[g\.  Then  x c ( Frank and  since  M(„[g] 
satisfies  AC  (being  a generic  extension  by  (a)  of  a model  of  AC),  let  / e M,,\g] 
be  an  injection:  ( Vrank(x))M“^]  — >■  On.  Then  / e M+  o Mm[g],  and  /“x  is  a 
set  of  ordinals  in  M+.  Hence,  /“x  e Mm[g]  by  the  previous  paragraph,  and  so 
x = f~Hf“x)  e M[0[g].  H 

This  last  is  part  of  the  argument  for  showing  that  if  two  transitive  e-models 
No  and  N i of  ZF  have  the  same  sets  of  ordinals  and  at  least  one  satisfies  AC,  then 
No  = N i (see  Jech  [03: 1 96ff] ). 

Dehomoy  [76, 78]  completely  described  the  inner  models  M j1"  in  terms  of  the 
Mf  s by  cases  as  follows:  (i)  <5  — a + to  for  some  a < 8.  Then  as  for  19.18, 
{Ka+n  | n e to}  is  Prikry  PUs -generic  over  M$  and  = M$[{Ka+n  \ n e to}]. 

(ii)  For  some  a < 8,  { Ma , e,  Ua)  )=  cf(<5)  > to.  Then  M j1"  = M$.  For  instance, 
(Mw,  e,  Uw)  \=  kc0  is  regular,  so  M+  = MKio.  (iii)  Neither  (i)  nor  (ii)  obtain  for  <5. 
Then  M = MS[GS ],  where  G$  is  a specified  set  of  Prikry  generic  sequences  not 
well-orderable  in  . In  particular,  AC  fails  in  . Dehomoy  [83]  extended 
the  scheme  to  inner  models  of  stronger  hypotheses  and  forcing  for  changing  to  an 
uncountable  cofinality. 

The  basic  theory  of  this  section  is  extended  in  the  next  to  derive  the  main 
structure  theorems  for  inner  models  of  measurability. 
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One  of  the  emerging  themes  of  the  theory  of  large  cardinals  in  the  late  1960’s 
was  the  investigation  of  the  smallest  inner  models  in  which  they  maintain  their 
defining  property.  Like  L for  the  theory  ZFC  these  canonical  inner  models  were 
found  to  crystallize  the  necessary  consequences  of  the  corresponding  theories  as 
well  as  to  establish  the  relative  consistency  of  minimal  principles  like  GCH.  Sil- 
ver and  Kunen  initiated  this  direction  of  research  with  their  structure  theorems  on 
inner  models  of  measurability,  which  are  established  in  this  section.  As  attention 
shifted  to  strong  hypotheses,  the  development  of  inner  model  theory  was  to  be- 
come a full-fledged  program.  Guiding  structural  initiatives  were  thus  introduced 
beyond  the  combinatorial  theory  of  large  cardinals,  and  this  was  to  lead  to  remark- 
able equiconsistency  results  by  the  complementary  methods  of  forcing  and  inner 
models. 

Inner  model  theory  began,  of  course,  with  Gddel’s  fundamental  work  on  L. 
For  modest  large  cardinals  like  inaccessible,  Mahlo,  and  weakly  compact  cardinals, 
the  corresponding  inner  model  is  L itself  (3. 1,6.7).  Jensen’s  systematic  investi- 
gations beginning  in  the  late  1 960’s  revealed  a fine  structure  for  L which  is  thus 
compatible  with  these  cardinals,  and  led  moreover  to  informative  characterizations 
in  this  sharpened  context.  He  also  isolated  combinatorial  principles  obtaining  in 
L whose  negations  turned  out  to  entail  these  cardinals  in  L (see  volume  II).  Thus, 
the  careful  investigation  of  L resulted  in  a new  structural  understanding  of  modest 
large  cardinals. 

Before  these  developments,  the  relativity  introduced  by  Scott’s  result  that 
measurable  cardinals  contradict  V — L had  set  the  stage  for  the  development 
of  inner  models  for  measurability,  first  considered  by  Solovay.  Let  U be  a k- 
complete  ultrafilter  over  k > w.  Since  the  measurability  of  k amounts  to  the 
existential  postulation  of  such  a set  U,  Solovay  looked  at  L[U],  Levy’s  inner 
model  of  sets  constructible  relative  to  U (§3).  U — U fl  L[U]  e L[U]  and  so 
L[U]  = L[U ],  and  the  following  is  simple  to  check: 

20.1  Exercise. 


L\U]  |=  U is  a k -complete  ultrafilter  over  k , 


and  if  U is  normal,  then 


L[U ] 1=  U is  normal  . H 

Thus,  k is  measurable  in  L\U],  and  like  L with  respect  to  ZF,  it  is  consistent 
with  k being  measurable  that  V — L[U],  so  that  U could  have  been  U all  along. 
Focusing  on  these  inner  models  of  measurability , 

( L[U ],  e,  U)  is  a K-model  iff  {L[U],  e,  U)  |=  U is  a normal  ultrafilter  over  k. 
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Thus,  U e L\U]  is  incorporated  from  the  beginning  as  a unary  relation.  That 
U e L[U]  implies  that  L[U]1^  = L[U ] by  3.2(b),  and  hence  that  L[U\  |=  ''V  — 
L[U]^ . It  is  not  assumed  that  k is  a cardinal  in  V.  (This  section  continues  the 
extensive  use  of  k,  X, ...  to  denote  cardinals  only  in  the  sense  of  some  transitive 
e-model.)  Initial  observations  were  promising: 

20.2  Lemma  (Solovay).  Suppose  that  (L[U],  e,  U)  is  a K-model.  Then  the  fol- 
lowing hold  in  L[U]: 

(a)  WX  >k( 2x  = X+). 

(b)  k is  the  only  measurable  cardinal. 

Proof,  (a)  Godel’s  argument  for  GCH  in  L works  in  any  L\A\  for  sufficiently 
large  cardinals;  in  the  present  case,  that  U c V(k)  allows  it  to  work  for  any 
X > k:  For  such  X we  shall  verify  that  V(X)  fl  L[U]  c Lk+[U],  drawing  the 
conclusion  from  |Lx+[£/]|  = X+ . Since  L[U]  \=  V — L[U~\,  the  argument  carried 
out  inside  L[U\  then  gives  the  desired  result. 

Suppose  then  that  x e V(X)  fl  L[U].  Let  y > X be  a limit  ordinal  such  that 
x,U  e Ly[U],  and  (H,  e,U  n H)  < ( Ly[U ],  e,  U)  such  that  X U [x,  U}  c H 
and  \H\  — X.  Let  (N,  e,W)  be  the  transitive  collapse  of  (//,  e,U  fl  H)  and  jt 
the  collapsing  isomorphism.  Since  X c H,  n(y)  = y for  every  y e V{X)  n H. 
Consequently,  7r(x)  = x and  W — jt“(U  fl  H)  = U fl  N.  By  elementarity 
(N,  e,  U fl  N)  satisfies  the  sentence  oq  of  3.3(b),  so  that  N — L$[U]  for  some  8. 
Finally,  |iV|  = X so  that  8 < A+,  and  the  result  follows  since  x e Lg[U]  c Lk+[U]. 

(b)  Assuming  throughout  that  V — L[U ],  suppose  to  the  contrary  that  there 
is  a measurable  cardinal  X f k.  Let  W be  a A-complete  ultrafilter  over  X and 

j:  V -<  M = Ult(V,  W)  . 

Since  V is  the  class  L[£/]  definable  from  U,  by  elementarity  and  3.2(b),  M — 
L[.j(U)]M  = L[j(U)].  It  will  in  fact  be  established  that  M — L[U ],  and  this 
suffices  for  a contradiction  since  by  5.7(e),  W <£  M. 

If  A > a:,  then  j (U)  — U,  and  so  we  are  done.  Suppose  then  that  X < k.  As 
U is  normal, 


Zs  = {o:<A:|a>A  A a is  inaccessible}  e U , 

and  by  5.7(c),  j(a ) = a for  every  a e E.  It  follows  from  this  that  j{U)  = 
U D M:  First,  suppose  that  X e j(U),  say  X = [f\w  with  / e XU . Then 
Y — H|<x  /(£)  € U by  /r-completeness,  and  clearly  j(Y ) c X.  But  j(Y ) 2 
j“(Y  n E)  = Y Pi  E e U since  j is  the  identity  on  E,  and  so  X e U n M.  With 
j (U ) C U DM  verified,  it  follows  that  j (U)  = U DM  since  j (U)  is  an  ultrafilter 
in  M.  3.2(b)  now  implies  that 

M — L[j ({/)]  = L[U  CM]  — L[U]  . H 

Extending  20.2(a),  Silver  obtained  the  first  substantial  result  about  k -models, 
that  full  GCH  holds: 
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20.3  Theorem  (Silver  [66a,  71a]).  Suppose  that  ( L[U\ , e,  U)  is  a K-model.  Then 
L[U ] \=  GCH. 

Proof.  Keeping  in  mind  that  L[U\  |=  V = L[U],  we  argue  in  L[U]  to  verify 
GCH  below  k there:  Suppose  that  X < k.  Recall  the  well-ordering  <l[u]  of  L[U] 
given  by  the  formula  (p\(vo,  Ui)  of  3.3(b).  Although  there  may  be  new  subsets  of 
X appearing  late  in  the  L[U]  = [Jff  La[U]  hierarchy,  we  shall  establish  a property 
that  clearly  suffices,  that  <l[u]  IC P(X)  x V{X))  has  ordertype  < k+.  In  other 
words,  for  any  y e V(X)  D L[U], 

\{x  e V(X)  n L[U]  | x <£[!/]  y}|  < X . 

Assume  to  the  contrary  that  this  fails.  Then  there  is  a y e V(X)  n L\U\  such 
that  with 

R = {x  e V{X)  n L[U\  | x <L[u ] >’}  , 

\R\=X+. 

Let  y be  a limit  ordinal  > k with  y,  U e Ly[U],  and  consider  the  structure 

A={Ly[U],e,U,R,{y})  ■ 

Since  U is  A+-Rowbottom  (as  defined  before  16.10),  by  the  argument  for  8.4(b) 
using  a language  with  constants  for  each  ordinal  less  than  X,  there  is  a 

B=  {B,e,Ur\B,Rr\B,  {y}>  -<  A 

such  that  \R  fi  B\  < X,  X c B,  and  B fi  k e U . Let  (N,  e,  W)  be  the  transitive 
collapse  of  ( B , e,  U (T  B ),  and  n the  collapsing  isomorphism.  Since  X c B, 
i t(x)  — x for  any  x e V(X)  D B.  In  particular,  y e N and  R fi  N = R fi  B. 
We  shall  verify  that  W — U fl  N.  By  elementarity  and  3.3(b)  this  would  imply 
that  N = Lp[(7]  for  some  limit  p,  and  R c N since  y e N,  leading  to  the 
contradiction  \N\  = |R  fl  N\  = |R  D B\  < X. 

To  verify  that  W = U fl  N,  note  that:  B D at  e U,  rt(f)  < % for  any  § e B, 
and  7 r is  injective.  It  follows  from  the  normality  of  U that 

E = {£  e B (Xk  | t t($)  =$}€[/. 

For  any  S e N,  say  S = n(S)  with  5 e fl,  it  follows  that:  S e W iff -ID  e 
U(D  fl  E C S ) (taking  D — S in  one  direction  since  W — : x“(U  fl  B ))  iff 
3D  e U(D  fl  E c 5))(since  i r is  the  identity  on  D (T  E)  iff  S e U . Hence, 
W = U fl  N and  the  proof  is  complete.  H 

Instead  of  going  into  inner  models  Jensen  [74]  established  the  relative  consis- 
tency of  GCH  with  a measurable  cardinal  by  forcing.  Relying  on  a combinatorial 
contingency,  he  showed  that  a measurable  cardinal  in  the  ground  model  remains 
measurable  upon  forcing  with  an  Easton  product  of  collapsing  p.o.’s.  This  method 
first  established  the  relative  consistency  of  GCH  with  Ramsey  cardinals,  for  which 
an  inner  model  theory  became  available  only  later. 
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Digressing  somewhat,  the  following  sketches  Solovay’s  original  proof  of 
16.1(b),  that  real-valued  measurable  cardinals  are  measurable  in  an  inner  model. 

20.4  Exercise  (Solovay  [71]).  Suppose  that  I is  a normal  X-saturated  ideal  over 
k where  X < k.  Then 

L[I]  |=  k is  measurable  . 

Hint.  Check  that  if  I = I fl  L[I],  then  L[I\  \=  I is  a normal  ^-saturated  ideal 
over  k.  Now  work  in  L[I]:  X+  < k by  16.3,  and  I is  A+-Rowbottom  by  16.11, 
so  check  that  the  argument  for  20.3  works  to  show  that  2X  — A.+  . Now  apply 
16.4(a).  H 

The  foregoing  results  about  L[U]  depended  on  U being  preserved  as  a pred- 
icate in  transitive  substructures,  i.e.  if  N is  the  domain,  what  corresponds  to  U 
is  just  U C\  N.  The  further  results  required  a more  sophisticated  analysis  of  rela- 
tive constructibility.  Although  just  the  Rowbottom  property  of  normal  ultrafilters 
suffices  for  the  GCH  result  20.3,  Silver  saw  that  the  Ramsey  property  leading 
to  substructures  generated  by  indiscemibles  provides  a A\  well-ordering  of  the 
reals  in  L[JJ],  sharpening  the  analogy  with  L.  Kunen  established  that  a k -model 
{L[U],  e,  U)  only  depends  on  at,  and  in  fact,  if  W is  any  k -complete  ultrafilter 
over  k then  L[U\  = L[W].  Moreover,  he  showed  that  all  the  /c -models  for  differ- 
ent /c’s  are  generated  by  taking  iterated  ultrapowers!  In  particular,  all  these  inner 
models  of  measurability  have  the  same  theory. 

With  the  preparations  of  §19,  these  and  further  structure  theorems  will  be 
established  in  a terminology  and  approach  that  anticipates  later  developments. 
In  particular,  iterated  ultrapower  proofs  will  be  provided  for  Silver’s  results  on 
GCH  and  the  well-ordering  of  the  reals.  The  clear  internal  structure  and 
striking  global  coherence  of  inner  models  of  measurability  revealed  by  these  results 
provide  a forceful  argument  for  the  consistency  of  the  theory:  ZFC  plus  there  is 
a measurable  cardinal. 

We  begin  by  considering  structures  that  approximate  at -models  and  sharpen 
the  §19  context: 

(M,  e,  U)  is  a ZFC”  premouse  {at  k)  iff 

U is  an  M-ultrafilter  over  k and 
M — Lt;[U ] for  some  £ (allowing  M — L[U])  ■ 

The  terminology  is  a capitulation  to  historical  development,  and  anticipates  a 
rodent  infestation  in  a finer  context  (the  Core  Model  - see  volume  II).  The  ZFC” 
refers  to  the  implicit  assumption  in  the  term  M-ultrafilter  that  M |=  ZFC”,  but 
for  the  sake  of  brevity 

by  premouse  is  meant  ZFC”  premouse  in  this  section 

with  the  caveat  that  the  term  will  have  a different  meaning  in  volume  II.  Some 
simple  observations: 
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(i)  The  weak  amenability  condition  for  M-ultrafilters  U is  redundant 
for  M = L([U]. 

(ii)  Any  iterate  of  a premouse  is  also  a premouse  by  3.3(b)  and 
and  elementarily. 

(iii)  A k -model  (L[U],  e,  U)  is  a premouse  such  that  U e L[U]. 

To  allow  this  inner  model  case  implicitly,  we  stipulate  that  in  this  context  that 
Lt;[U]  includes  the  possibility  “f  = On”,  i.e.  L^[U]  — L[U ] . 

Iterability  in  such  cases  is  an  immediate  consequence  of  19.16: 

20.5  Lemma.  Suppose  that  (Lf[t/],e,  U)  is  a premouse,  L%[U]  1=  ZFC,  and 

to i U {[/}  c Lf[£/].  Then  (Lj[t/],  e,  U)  is  iterable.  H 

In  particular,  any  k -model  is  iterable  (and  its  iterates  are  definable  in  it). 

The  following  is  a crucial  observation  about  the  definability  of  certain  iterates 
of  a premouse.  Recall  that  if  v > co  is  regular,  then  Cv  denotes  the  closed 
unbounded  filter  over  v. 

20.6  Lemma.  Suppose  that  (M,  e,  U)  is  a premouse  at  k with  iteration 
(Ma,  Ua,  Ka , iaft)a<ft€T>  titid  v e t is  a regular  cardinal  > |kA  (T  M |.  Then  for 
some  f 

My  = Lc[Cy]  and  Uv  = Cv  n L( [Cv]  . 

Proof.  kv  = v by  19.7(b),  and  Uv  c CVC\MV  by  19.5.  Hence,  Uv  = CVC\MV  since 
Uv  is  an  ultrafilter  on  V(v)  D Mv.  Since  (Mv,  e,  Cv  fi  Mv)  satisfies  the  sentence 
a\  of  3.3(b),  the  result  follows.  H 

20.7  Corollary  (Kunen  [68,70]).  Suppose  that  there  is  a K-model,  and  v is  a 
regular  cardinal  greater  than  k+ . Set  Cv  — CVC\  L[CU],  Then  L[CV\  is  a v-model, 
i.e. 

{L[CV\,  e,Cv)  1=  Cv  is  a normal  ultrafilter  over  v . 

Proof.  Let  (L[(7],  e,  U)  be  a k -model.  By  20.5,  (L[U],  e,  U)  is  iterable  and  by 
20.2(a),  k+  > I**'  fl  L[U] |.  Hence,  the  result  follows  from  20.6,  since  L[CV,]  = 
L\CV\  is  the  vth  iterate  of  L[U].  H 

Thus,  constructing  relative  to  filters  definable  in  ZFC  can  result  in  internal 
measurability!  This  compelling  insight  of  Kunen’s  became  a recurring  theme  in 
inner  model  theory,  bringing  Ulam’s  concept  of  measurability  further  into  the  fold 
of  set  theory. 

The  next  result  is  the  main  technical  lemma  from  which  the  structure  theorems 

flow. 

(M,e,U)  and  (N,e,W)  are  comparable  iff 

either  U = W C\  M or  W = UPN  . 
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In  other  words,  there  is  an  F such  that  U — F fl  M and  W — F fi  N with 
M = L^[F]  and  N — LV[F\  for  some  t,  and  rj,  so  that  M and  N are  initial 
segments  of  the  same  relative  constructible  hierarchy.  Of  course,  if  M and  N 
are  inner  models,  then  M = N.  The  following  direct  consequence  of  20.6  is  the 
Comparison  Lemma  for  ZFC-  Premice. 

20.8  Lemma.  Suppose  that  (M,  e,  U)  and  (N,  e,  W)  are  iterable  premice.  Then 
there  is  an  iterate  of  ( M , e,  U)  and  an  iterate  of  ( N , e,  W ) that  are  comparable. 

H 


We  need  one  further  preliminary  result  on  definability.  The  3.3(b)  formula 
<Pi(t>o,  Ui)  of  £e(A)  defines  in  any  (L[A],  e,  A 0 L[A]}  a well-ordering  <l[a]  of 
L[A]  such  that:  for  any  limit  8 > co  and  x,y  e La  [A], 

* <L[A\  y iff  (LS[A],  e,  A D La[A]>  \=  <pi[x,  y]  . 

Following  a discussion  after  9.3,  for  any  formula  (p{v o,...,vm)  of  Ce(A)  the 
canonical  Skolem  term  tv  for  q>  can  be  defined  that  chooses  the  least  witness  to 
Do  according  to  <p\  whenever  possible.  Consequently,  for  any  limit  ordinal  8 > co 
and  X c La[A],  the  Skolem  hull  of  X in  (La[A],  e,  A fi  La[A])  can  be  taken  to 
be  well-defined,  with  domain 

{^ii[A],e,AnLj[A])(x^  ^ x„)  | <p  is  a formula  of  £e(A)  A xi, . . . , xn  e X}  . 

This  coincides  with  the  collection  of  those  x e La  [A]  such  that  {.r}  is  definable 
over  (La[A],  e,  A fi  La[A])  using  parameters  from  X. 

By  Skolem  term  is  meant  one  of  these  tf  s in  this  section  . 

20.9  Lemma.  Suppose  that  ( L[U ],  e,  U)  is  a K-model,  B c On  with  B\  > k+ , 
and  8 is  a limit  ordinal  such  that  B C Ls[U].  Then  V(jc)  fl  L[U]  is  a subset  of  the 
Skolem  hull  of  k U B in  (La[t7],  e,  U),  so  that  for  any  X e V(k)  fl  L[U], 

X = tlL‘M  ’e’u)^u...,U,rn,---,rin) 


for  some  Skolem  term  t,  £i , • . . , e k,  and  rji , . . . , rjn  e B. 

Proof.  Let  ( H . e,U  fl  H)  be  the  stated  Skolem  hull,  ( N , e,  W)  its  transitive 
collapse,  and  tz  the  collapsing  isomorphism.  Then  it  (a)  — a for  every  a < k,  so 
jt( X)  = X for  every  X e V(k)  fl  H.  Consequently,  W — U fl  N,  so  that  using 
the  sentence  o\  of  3.3(b),  N — Lf  [U\  for  some  £.  t;  > k+  since  |Z?|  > k+ , so  by 
the  proof  of  20.2(a),  V(k)  fl  L[U]  c N.  Hence,  V(k)  fl  L[U]  c H as  tt  is  the 
identity  on  subsets  of  k.  H 


The  Main  Results 

We  are  now  in  a position  to  prove  the  main  structure  theorems  on  k -models. 
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20.10  Theorem  (Kunen  [68,70]).  Suppose  that  (L[t/],  e,  U)  and  {L[W\,  e,  W) 
are  both  K-models.  Then  U — W and  so  L\U]  = L[VF], 

Proof.  By  20.5  {L[U],  e,  U)  and  (L[1F],  e,  W)  are  iterable,  so  by  20.8  they  have 
comparable  iterates  and  hence  a common  iterate  (L[F],  F)  since  L[U]  and 
L\W\  are  inner  models.  Let 

i:{L[U\,e,U)  ^ {L[F],e,  F)  and 
j:(L[W],e,W)  < (L[F],  e,  F) 

be  the  corresponding  embeddings.  By  19.7(c), 

{0  >k  \e  = \6\=  i(6)  = j(9)} 

is  a proper  class,  so  let  B U {5}  consist  of  members  of  this  class  so  that  \B\  = k+ 
and  8 > sup(B). 

For  any  lei/,  by  20.9 

for  some  Skolem  term  f,  f i, ...  ,%m  e k,  and  rji, ...  .rj„  e B.  Set 

Y = t{Lslw]’e'w) . 

Since  Skolem  terms  are  definable, 

i(X)  = t^in.e.F)^u  . . . , ^ . . . , r,n)  = j(Y ) 

noting  that  all  the  parameters  are  fixed  by  i and  j . X e U,  so  i(X)  = j(Y ) e F, 
and  hence  Y e W.  But  also 

X = i(X)  (T*r  = i(Y)  n k = Y . 

This  argument  establishes  that  U C.  W,  and  the  converse  follows  mutatis  mutandis. 

H 


20.11  Corollary.  If  (L\U],  e,  U)  is  the  K-model,  then  U is  the  only  normal 
ultrafilter  over  k in  L[U]. 

Proof.  Suppose  that  W is  a normal  ultrafilter  over  k in  L\U\.  Setting  W — 
W (T  L[W~\,  L[W~\  is  a k -model.  Hence  W — U,  and  since  W is  an  ultrafilter  in 
L[U],  W = U.  H 

Thus,  if  there  is  a a: -model  ( L [ U ] . e.  U),  then  it  is  unique,  k is  the  only 
measurable  cardinal  in  L[U]  (20.2(b)),  and  U is  the  only  normal  ultrafilter  over 
k in  L[U].  This  last  contrasts  with  the  consistency  of  having  the  maximum 
possible  number  7r  of  normal  ultrafilters  (17.8).  (The  possible  number  of  normal 
ultrafilters  is  further  discussed  before  22.13.) 


268 


Chapter  4.  Aspects  of  Measurability 


The  next  theorem  shows  that  all  the  k -models  for  various  k are  produced 
by  iterating  the  F-model  where  k is  least  possible;  consequently,  all  a: -models  for 
various  k are  definable  subclasses  of  this  F-model  and  are  elementarily  equivalent 
to  each  other. 

20.12  Theorem  (Kunen  [70]).  Suppose  that  {L[U],  e,  JJ)  is  the  K-model  and 
{L\W\,  e,  W)  is  the  X-model  with  k < X.  Then  {L\W\,  €,  W)  is  an  iterate  of 
(L[U],  e,  U).  In  particular,  L[W~\  C L[U]  and  ( L[W ],  e,  W)  is  definable  in 
{L[U],  e,  U)  from  X. 

Proof.  Let  (L[Ua\,  Ua,  Ka,  iap) a<peOn  be  the  iteration  of  {L[U],  e , U),  so  that  in 
particular  each  (L[Ua],  e,  Ua)  is  definable  in  {L[U\,  e,  U).  Since  the  Kf  s form  a 
closed  unbounded  class  of  ordinals,  there  is  a unique  /I  such  that  Kp  < X < Kp+\. 
If  Kp  = X,  we  are  done  by  20.10.  So,  assume  toward  a contradiction  that  Kp  < 

^ < Kp+ 1: 

Let  (L[F],  e,  F)  be  a common  iterate  of  {L[U],  e,  U)  and  (L[W],  e,  W), 

with 

/:  {L[U],  e,  U)  < {L[F],  e,  F)  and 
j:  (L[W],e,W)  < [L[F],  e,  F) 

the  corresponding  embeddings.  Hence,  i = ioy  for  some  y > f).  Arguing  as  for 
20.10,  let  B U {5}  consist  of  cardinals  greater  than  X fixed  by  both  ipy  and  j,  with 
\B\  = Kp  and  8 > sup(fi). 

By  5.13(a)  there  is  an  f e Kf<Kp  fl  L[Up]  such  that  X = ip,p+\{f){Kp).  By 

20.9, 

f _ fLslUple.Up)  . . . , |m,  . . . , tjn) 

for  some  Skolem  term  t,  £i, . . . , e Kp,  and  rji,  ...,r)n  e B.  Hence, 

X — ip,p+\{f){Kp)  = M<Ld^+i],e,U/i+i>  ; £m,  Kp,  ?;„) 

for  some  Skolem  term  u.  By  definability  of  Skolem  terms, 

A = u{Lslne'F)(%  1,  . . . , Kp,  111 tin) 

since  X < Kp+\  and  so  every  parameter  as  well  as  X is  fixed  by  ip+i,Y.  But  since 
Kp  < X,  the  expression  on  the  right  also  equals 

j(u{L‘lw]'€’w)(% i,  Kp,  r)\,  ..■ , tin)) 

for  similar  reasons.  Thus  X e ran  (j),  contradicting  X = crit(y')-  H 

These  results  have  emphasized  normal  ultrafilters.  Kunen  also  showed  that 
constructing  relative  to  an  arbitrary  v-complctc  ultrafilter  over  k again  yields  the 
a: -model,  and  analyzed  the  structure  of  the  k -complete  ultrafilters  over  k in  the 
a: -model.  The  following  technical  result  is  the  basis: 
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20.13  Exercise  (Kunen  [70]).  Suppose  that  W is  any  K-complete  ultrafilter  over 
k > co,  and  j:  V < Mw  = Ult(V,  W).  Let  (L[U\,  e,  U)  be  the  K-model  with 
iteration  {L\Ua],  Ua , Ka , iap)a<pe  On-  Then  for  some  f,  j and  i $p  agree  on  V(k)  fl 
L[U]. 

Hint.  L[j(U)]Mw  = L[j(U )]  is  the  j (/c)-model,  so  by  20.12  there  is  a ft  such 
that  Kp  — j(K ) and  Up  — j (U).  Show  that  this  ft  is  as  desired  with  an  appropriate 
use  of  20.9.  H 


20.14  Corollary.  Under  the  above  hypotheses,  L\W~\  — L[U ]. 

Proof.  Since  k is  measurable  in  L[W],  there  is  a k -model  which  is  a subclass  of 
L[W],  Hence,  by  20.10,  L[U]  c L[W~\.  For  the  converse,  it  suffices  to  show  that 
W fl  L[U]  e L[U],  for  then  L[U]  3 L[W  fl  L[U]\  — L[W\.  For  this  purpose,  let 
p — [id]  w,  where  id:  k ->  k is  the  identity,  so  that  for  any  X C k,  X e W iff 
p e j(X).  Then  using  20.13, 

w n L[U\  = {Xe  V(K)  n L[U]  | P e j(X)) 

= {Xe  V(k)  n L[U\  | p e i0p(X)}  e L[U] 

since  iop  is  definable  in  { L[U\ , e,U).  ~\ 


Proceeding  toward  a description  of  the  /r-complete  ultrafilters  over  k in  the 
a: -model,  the  rudiments  of  a structure  theory  for  ultrafilters  are  discussed  next,  a 
subject  which  is  pursued  further  in  volume  II: 

Suppose  that  S and  T are  sets,  /:  S — > T , and  D is  an  ultrafilter  over  S. 


Then 


/*(£>)  = {KT  | e D}  . 


If  principal  ultrafilters  were  allowed,  it  can  be  checked  that  /*(£))  is  always  an 
ultrafilter  over  T , the  natural  projection  of  D via  /.  Note  that  if  X <=  D,  then 
/“X  e /*(£>).  If  moreover  E is  an  ultrafilter  over  T, 


D and  E are  equivalent  iff  there  is  a bijection  f:S->T  such  that  E = ffiD). 


Thus,  equivalent  ultrafilters  are  the  same  up  to  a relabeling  of  their  underlying 
sets.  The  following  is  well-known. 


20.15  Proposition.  Suppose  that  D is  an  ultrafilter  over  S and  /:  S ->  7 . 

(a)  If  g:  S -»■  T and  [i  e S | f(i)  — g(i)}  e D.  then  g*(D)  — /*(£>). 

(b)  If  h:  S — > S and  hfiD)  — D.  then  [i  € S \ h(i ) = ;}  € D. 

(c)  If  E is  an  ultrafilter  over  T,  |r|  = |S|,  ffiD)  = E,  and  g:  T — > S is 
such  that  gfiE)  — D,  then  D and  E are  equivalent. 

Proof,  (a)  This  is  clear. 

(b)  Assume  to  the  contrary  that  {;'  e S \ h(i ) > /}  e D;  the  < case  is 
analogous.  By  (a),  it  can  be  assumed  that  h{i)  > i for  every  i e S.  Let  h°(i)  = 
i,  hn+\i)  = h{hn  (/))  for  n e co,  and  define  a relation  ~ on  5 by: 
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«'~7  iff  3mln(hm(i)  = hnU))  . 

It  can  be  checked  that  ~ is  an  equivalence  relation,  so  fix  a representative  for  each 
equivalence  class.  Let  So  C S consist  of  those  i such  that  if  r is  the  representative 
of  the  equivalence  class  of  i and  hm(i)  = hn(r),  then  | m — n\  is  an  even  number. 
(Note  that  this  is  independent  of  m and  n as  h is  increasing  on  .S'.)  Set  ,S|  = S — Sq. 
Then  for  some  e <2,  Se  e D,  and  so  h“Se  e h*{D)  = D.  But  it  is  simple  to  see 
that  h“Se  c S\-e,  leading  to  the  contradictory  Se  fl  h“Se  = 0 e D. 

(c)  (g  o /)*(£))  = D,  so  that  {;'  £ S g(f(i))  = i } e_D  by  (b),  and  / is 
injective  on  this  set.  Since  |S|  = \T\,  there  is  a bijection  f:S-+T  such  that 
{i  € S | f(i)  — /(/)}  e D,  and  so  the  proof  is  complete  by  (a).  H 

In  L\U],  where  (L[U],  e,  U)  is  the  K-model,  U is  the  only  normal  ultrafilter 
over  k,  and  from  U the  product  ultrafilters  U"  over  [a:]"  for  new  can  be  defined 
(as  after  19.7).  (b)  below  asserts  that  every  /r-complete  ultrafilter  over  k in  L[U] 
is  essentially  one  of  these. 

20.16  Theorem  (Kunen  [70];  Paris  [69]  also  for  (a)).  Suppose  that  V = L[U] 
where  (L[U\,  e,  U)  is  the  K-model  with  iteration  ( L\Ua ],  Ua,  Ka,  iap)a<pe On-  Let 
W be  any  k -complete  ultrafilter  over  k.  Then: 

(a)  W — {X  C k | p e iQio(X)}  for  some  p < k10. 

(b)  W is  equivalent  to  Un  for  some  n with  0 < n e co. 

Proof.  Letting  j:  V -<  Mw  = Ult(V,  IT),  first  observe  that  V = L[U~\  implies  that 
the  f of  20.13  must  be  finite:  By  20.14,  V — L[W\  and  Mw  — L[/'(W)]  = L[Up] 
by  the  uniqueness  of  j (Ar)-models.  If  to  the  contrary  f > co,  then  Km  is  inaccessible 
in  L[Up]  since  it  is  measurable  in  L[UC0\  and  7 '’(*:«)  H L[Uco\  — V(k,o)  fl  L[Up]. 
However,  since  Mw  contains  every  countable  sequence  of  ordinals  by  5.7(d),  Km 
has  cofinality  co  there,  which  is  a contradiction.  Hence,  f = n for  some  n with 
0 < n e co. 

(a)  now  follows  from  the  proof  of  20.14,  since  /,!M(p)  = p for  any  p < 
j(K)  — i0n(K). 

For  (b),  if 

en:V  < Mun  ^ ^"v/Un  , 

then  = L[Un ] and  en  — io„  by  the  analysis  of  19.9(b).  Let  id:  k — >■  k and 
id:  [k]"  — >■  [a:]"  be  the  identity  maps  on  their  respective  domains,  and  /:  a:  — > [a:]” 
and  g : [a:]"  — > k such  that  [/] w = [id][/«  and  [g]un  = [id] • Because  j and  en 
agree  on  V(k)  and  hence  on  V([k]"),  for  any  X C [a:]", 

X e Un  iff  [Id e en(X) 
iff  [f]wej(X) 
iff  f~\X)eW. 


Hence,  f*(W)  = Un . Similarly,  g*(Un)  — W,  and  so  the  proof  is  complete  by 
20.15(c).  H 
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The  next  goal  is  to  establish  Silver’s  result  that  there  is  a Al  well-ordering 
of  the  reals  in  L[U ].  The  argument  is  in  terms  of  premice,  and  to  motivate  it, 
another  proof  of  Silver’s  GCH  result  20.3  is  first  given,  one  that  has  the  advantage 
of  having  generalizations  to  inner  models  of  stronger  hypotheses. 

Second  Proof  of  20.3.  As  before,  it  suffices  to  show  for  X < k that  for  any 
yeV(X)nL[Ul 


\{x  e V(X)  n L[U]  I X <L[U ] v}|  < 1 . 

Suppose  then  that  y e V{X)  fl  L[U],  and  let  x e V(X)  n L[U]  with  x <l[u]  y, 
i.e.  [L[U],  e,  U)  \=  <p\\x,  y]  for  (P\{vq,  tq)  of  3.3(b).  Then: 

(*)  If  ( A , e,W)  is  an  iterate  premouse  at 

some  v > X and  y e A , then  x e N. 

To  show  this,  by  20.8  let  (L[F],  e,  F)  and  {L^[F],  e,  F D Lf[A])  be  compara- 
ble iterates  of  (L[t/],  e,  U)  and  (A,  e,  W)  respectively.  Then  ye  Vv  fl  A — 
Vv  fl  L^[F]  by  19.4(b).  Remembering  that  X < k,  (L[F],  e,  F)  \=  (pi[x,  y] 
by  elementarity,  and  so  by  3.3(b),  y e L^[F]  implies  that  x e L^F],  Hence, 
x e Vv  fl  LfF]  — Vvn  N.  (Note  for  the  next  theorem  that  (A,  e,  W)  ^ (p\[x,  y], 
again  by  elementarity.) 

With  (*),  the  proof  devolves  to  finding  an  iterable  premouse  (A,  e,  W)  at 
some  v > X with  |A|  < X.  But  this  is  simple:  Let  0 be  a sufficiently  large 
regular  cardinal  such  that  y,U  e Ln\U],  Then  by  the  absoluteness  19.16  applied 
with  S = L[U\ , {Lg[U],  e,U)  is  an  iterable  premouse.  Take  the  Skolem  hull  of 
kU{y}  in  ( Lg[U ],  e,  (7);  then  its  transitive  collapse  is  a premouse  as  desired,  with 
iterability  a consequence  of  19.15(e).  H 

The  next  results  are  about  the  set  of  reals  in  the  k -model.  An  early  intimation 
of  the  Core  Model  (see  volume  II)  is  that  this  set  is  independent  even  of  at;  this 
follows  by  a simpler  version  of  the  previous  argument  depending  only  on  20.12: 

20.17  Exercise.  Suppose  that  (L[I7],  e,  U)  is  the  K-model  and  (L[W],  e,  W)  is 
the  X-model  for  some  k and  X.  Then 

“co  fl  L[U~\  = mu>  fl  L[W]  and  2Ccl>)  fl  <L[£/]=  2Ca>)  fl  <l[w]  ■ H 


In  referring  to  these  sets  in  what  follows,  L[U\  is  used  generically  since  the 
sets  do  not  depend  on  any  particular  choice.  Recall  from  before  13.8  that  any 
z ewa>  encodes  a binary  relation  Ez  = {(m,  n)  \ z({m,  «))  = 0}  and  consequently 
a structure  Mz  = (co,  Ez)  for  the  language  Ce  of  set  theory.  Defining  Az  — 
{n  | z(n)  = 1},  z also  encodes  a structure 


Nz  = (w,  Ez,  Az) 
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for  the  augmented  language  CAA).  As  before,  if  Nz  is  well-founded  and  ex- 
tensional,  then  it  has  a transitive  collapse  tr (Nz)  and  an  isomorphism  nz:  Nz  -> 
tr  (Nz). 

The  following  is  the  analogue  of  Godel’s  result  13.9  for  L: 

20.18  Theorem  (Silver  [71b]).  wco(lL[U]  and2{m(A)  n <L[U]  are 
Proof.  Proceeding  directly  to  the  well-ordering,  for  any  x,  y e w(o  IT  L[U], 

(**)  x < l[u]  y iff  there  is  a countable,  iterable  premouse  (N,  e,  W) 
with  x,y  e N and  ( N , e,  W)  |=  <pi[x,  y]  . 

This  follows  from  the  argument  for  (*)  in  the  proof  of  20.16  together  with  the 
Skolem  hull  argument  at  its  end.  It  remains  to  verify  that  this  gives  a descrip- 
tion: 

Building  on  3.3(b)  and  13.8(a)(c), 

Z = [z  e “&>  | Nz  is  well-founded  and  extensional  A tr(Ak)  is  a premouse} 

is  77/.  It  now  suffices  to  show  that  (z  e Z A tr (Nz)  is  iterable)  is  77/,  for  then 
(**)  can  be  rendered  as  3 z(77/ ). 

By  19.15(b)  a premouse  is  iterable  iff  its  iteration  has  length  > co\.  Recalling 
the  analysis  14.11  of  0#,  this  can  be  asserted  through  codes  for  countable  well- 
orderings.  First,  with  -<_  as  before  19.3  let  R c 3(“tt>)  be  given  by  R(a,b*z) 

iff 

(i)  Ea  — {( m,n ) \ a((m,  n))  — 0}  is  a well-ordering  with  field  a>. 

(ii)  b codes  a set  {zb„  \ n e co}  c Z together  with  codes  for  embeddings 

tr (Nzb)  tr)^*)  for  (. m,n ) e Ea  such  that: 

(iii)  if  n is  the  minimum  in  terms  of  Ea,  then  zbn  — z; 

(iv)  if  n is  the  immediate  successor  of  m in  terms  of  Ea,  then  tr (N,b) 
is  the  ultrapower  of  tr(NzA  and  imn  the  corresponding  embedding; 
and 

(v)  if  n is  a limit  point  of  Ea,  then  tr(AA*)  is  the  direct  limit  of  the 
corresponding  structures  and  embeddings,  and  imn  for  ( m , n)  e Ea 
the  corresponding  embedding  into  the  direct  limit. 

It  is  tedious  but  straightforward  to  check  that  R is  Fl\  since  Z is  77/  and  satisfaction 
for  NZn  is  A\  (cf.  13.8(b)(c)). 

Next,  let  S c 2(loa))  be  given  by  S(a,  b ) iff 

(i),  (ii),  (iv),  and  (v)  above. 

(vi)  if  Ea  has  a maximum  element  n,  then  the  ultrapower  of  tr(/V,n)  is 
well-founded. 

(vii)  if  Ea  has  no  maximum  element,  then  the  direct  limit  of  the  struc- 
tures and  embeddings  coded  by  b is  well-founded. 

S is  also  77/. 
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Finally, 

(z  e Z A tr (TV,)  is  iterable)  iff  z e Z A v'oV1  b(R(a,  b,  z)  —*■  S{a,  b))  , 
which  is  of  form  Fl\  —*■  77j),  and  hence  77^ . H 

20.19  Corollary.  If  01  oj  C L[£7],  t/?en  2("w)  fl  <l[£/]  w a A3  set  which  is  not 
Lebesgue  measurable  and  does  not  have  the  Baire  property. 

Proof.  As  for  13.10.  H 

As  for  L (cf.  13.11)  there  is  a refinement  of  20.19,  establishing  in  the  parlance 
that  in  L[U],  2(“w)  fl  <l[u]  is  a .£3 -good  well-ordering: 

20.20  Exercise  (Silver  [71b:  440]).  The  relation  ISl[j/]  (=  2(“&>)  given  by 

ISL[(/](.r,  y)  {(*),■  \i  eco}  = {z  e “co]  z <L[u ] y) 
is  E\.  H 

For  the  perfect  set  property,  the  argument  for  L (13.12)  can  be  adapted.  For 
this  purpose,  first  observe  that  there  is  a natural  ordering  of  iterable  premice: 

(M,  e,  U)  <iP  ( N , e,  W ) iff  there  is  an  F such  that  (Lf  [F],  e,  F fl  [F])  is 

an  iterate  of  (M,  e,  U),  (L,,[F],  e,Fn  L,,[F]}  is 
an  iterate  of  ( N , e,  W),  and  t;  < q . 

It  is  straightforward  to  check  using  the  Comparison  Lemma  20.8  that  <;p  is  a well- 
defined  well-ordering  of  iterable  set  premice.  Next  observe  that  countable  such 
(M,  e,  U)  and  ( N , e,  W)  have  countable  iterates  that  are  comparable:  Suppose 
that 

( Ma , Ua,Ka , iafi)a<! 8eOn  is  the  iteration  of  (M,  e,  U)  , and 
(Na,  Wa.  ka,  jaft) a<fieOn  is  the  iteration  of  (A,  e,  W)  . 

Noting  that  /c^  = = Xm  by  19.7(c)  so  that  {at„  | a < a>\}  and  {!„  | a < wi] 

are  both  closed  unbounded  in  &>i,  let  (yn  new)  be  an  increasing  sequence 

of  ordinals  drawn  from  their  intersection  and  set  y — sup([y„  | n e co}).  Then 
y — Ka  — Xp  for  some  a.  ji  < co i,  and  with  F the  filter  generated  by  (yn  \ n e co) 
(i.e.  X e F iff  3 m e &>({y„  | m < n e co]  c X c y)), 

(M„,  e,  £/„)  = <Lf  [F],  e,  F fl  Lf  [F])  , and 
(A/j,  e,  Wp)  = (i,[F],eFni,[F]) 

for  some  £ and  q. 

20.21  Theorem  (Silver  [71b:  440];  Solovay).  If  = co\,  then  there  is  a n\ 
set  without  the  perfect  set  property. 
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Proof.  Adapting  the  argument  for  13.12,  the  idea  is  to  get  a 77)  set  of  unique  codes 
for  ordinals  less  than  [c/1.  For  such  an  ordinal  a,  there  is  an  xo  e a>coP\L[U~\  such 
that  Mxo  — ( co , Exo)  is  a well-ordered  set  with  ordertype  a.  To  specify  the  <L[uy 
least  such  xq  is  not  n),  but  it  turns  out  to  be  772'  to  say  of  some  x\  e 10  co  fi  L[U]: 
NXI  = ( N , e,  W)  where  the  latter  is  the  <iP-least  countable  iterable  premouse 
such  that  some  *0  e N is  the  <l[u]  \(N  x A)-least  possible  as  above.  Again,  to 
specify  the  <z.[£/]-least  such  x\  is  not  77,1,  but  it  can  be  similarly  specified  it  in 
terms  of  some  NX1,  and  so  forth.  The  resulting  (x,-  | i e co)  can  then  serve  as  a 
unique  code  for  a. 

For  any  x,  y e 01  co,  that  tr (Ny),  if  defined,  is  the  <ip-least  possible  countable 
iterable  premouse  containing  x is  equivalent  by  the  remark  preceding  the  theorem 
to:  Vlz(tr(Nz)  is  defined  and  is  a premouse  with  a countable  iterate  isomorphic 
to  a proper  initial  segment  of  a countable  iterate  of  tr (Ny)  ->  x ^ tr (IV-)),  noting 
that  the  antecedent  to  this  implication  implies  that  tr(Nz)  is  iterable. 

Recalling  the  relation  A of  the  proof  of  13.12  and  availing  ourselves  of  the 
77 j1  Z and  R of  the  proof  of  20.18,  we  are  led  to  the  following  relation  on  '"or. 

A(x)  **  M(X)0  is  well-ordered  A V°/[(x),-+i  e Z 

A tr (N(x)l+1)  is  iterable  A (x),  e tr(ATWi.+1) 

A e Z A R(a,b,z ) A R(c,  d,  (x),+i) 

A m is  the  maximum  element  of  Ea 
A n is  the  maximum  element  of  Ec 
A 3'/(/  codes  an  isomorphism  of  AT,*  into  a 
proper  initial  segment  of  Nr)  -r  (x),  £ tr(AA;/)] 

A VlyV0pV°q[Ji(x)l+.1(p)  — y A nMl+l(q)  = (x),- 
A (N(x)i+i\=<pi[p,q]  -a  (i  = 0 — >■  My¥MMo) 

A (i  > 0 Ny?  A7W())]] 

where  <p\  (vq.  i>i)  is  the  well-ordering  formula  of  3.3(b).  Now  getting  on  track  and 
following  the  13.12  argument,  it  is  straightforward  to  check  that  A is  a 77 ) set 
of  cardinality  |oq[ul|,  and  that  A does  not  have  a perfect  (even  uncountable  S[) 
subset.  H 

These  results  are  the  best  possible  in  the  projective  hierarchy  since  Solovay 
had  shown  that  if  there  is  a measurable  cardinal,  then  every  Li  set  is  Lebesgue 
measurable,  has  the  Baire  property,  and  has  the  perfect  set  property  (14.3, 14.10). 
Silver  had  been  motivated  in  part  by  this,  and  his  results  established  a new  de- 
limitation at  A3.  This  in  turn  led  to  an  investigation  of  stronger  large  cardinal 
principles  that  might  similarly  correlate  with  regularity  properties  higher  in  the 
projective  hierarchy.  Unlike  Gddel’s  hopes  for  the  Continuum  Problem  (§3),  the 
relative  concreteness  of  the  projective  hierarchy  afforded  the  possibility  of  a sys- 
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tematic  study,  one  that  was  to  meet  with  remarkable  success  by  the  late  1980’s 
(§32). 

The  concluding  results  of  this  section  concerns  other  large  cardinals  and  inner 
models  of  measurability.  Of  course,  in  the  presence  of  a measurable  cardinal  more 
modest  large  cardinals  exist  in  great  profusion  by  previous  reflection  results.  The 
Jonsson  and  Rowbottom  cardinals  (§8)  are  somewhat  anomalous  in  that  although 
they  have  substantial  consistency  strength  (2 1 .4)  they  themselves  may  not  be  very 
large.  Kunen  on  the  other  hand  established  that  in  the  k -model  these  cardinals 
coincide  with  the  ostensibly  stronger  Ramsey  cardinals.  Thus,  one  cannot  establish 
in  the  theory  ZFC  plus  there  is  a measurable  cardinal  that  there  are,  e.g.  singular 
Jonsson  cardinals.  An  outright  limitation  reminiscent  of  8.13  is  first  noted: 

20.22  Exercise.  Suppose  that  (L\U],  G,  U)  is  the  k -model  and  there  is  a Jonsson 
cardinal  X > k.  Then  V L\U]. 

Hint.  Working  in  L[U],  argue  by  contradiction  and  take  X to  be  the  least  Jonsson 
cardinal  > k.  By  8.14(c)  and  8.15(a)(b)  there  is  a regular  v satisfying  k < v < X, 
such  that  X -»  [A.]<®v.  By  8.4(b)  let 

( H , G,  U n H,  v n H)  -<  [Lx[U],  G,  U,  v) 

be  such  that  \H\  — X,  k + 1 c H,  and  |v  n H\  < v.  The  transitive  collapse  of 
{H,  G,  U n H)  is  again  (Lk[U],  G,  U),  so  if  i:  ( LX[U ],  G,  U)  < (Lx[U],  G,  U)  is 
the  inverse  of  the  collapsing  map,  crit(i)  is  a measurable  cardinal  greater  than  k 
in  L[U],  which  is  a contradiction.  H 

This  result  is  extended  by  21.23. 

20.23  Theorem  (Kunen  [70]).  Suppose  that  {L[U],  G,  U)  is  the  k -model for  some 
k.  Then 

L[U\  |=  VX(X  is  Jonsson  o-  X is  Ramsey)  . 

Proof.  Working  in  L[U],  suppose  that  X is  Jonsson,  and  /:  [A.]<<a  — > 2.  We 
must  find  a member  of  [k]1  homogeneous  for  /,  and  this  is  first  done  under  an 
additional  assumption: 

(*)  There  is  a S < X and  an  x c 8 such  that  / <l[u]  x ■ 

This  uses  arguments  employed  in  the  proof  of  GCH:  Noting  that  X < k by  20.22, 
there  is  an  iterable  premouse  ( N , g,  W)  at  some  p > 8 with  x e N and  \N\  = |<$|. 
If  (AT,  €,  Wx)  is  then  the  kth  iterate  of  ( N , G,  W)  with  jox  the  corresponding 
embedding,  jox(p)  — k and  x G Nx . By  Comparison,  let  (L[F],  G,  F)  and 
{L^[F],  G,fn  Lt; [F])  be  comparable  iterates  of  ( L[U ],  G,  U)  and  (Nx,  e,  Wx) 
respectively.  Since  / <l[u]  x,  by  elementarily,  / <l[f]  x.  Also  V(X)  fi  Nx  = 
V(X)  fl  Lf  [F],  so  that  x G Nx  implies  that  / G Nx-  But  X is  measurable  in  Nx, 
so  / has  a homogeneous  set  as  required. 
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The  general  case  yields  to  similar  arguments.  First,  there  is  an  iterable  pre- 
mouse ( M , e,  Y)  at  some  a > X with  / e M and  \M\  — X.  Let  g:  X ->  M be  a 
bijection.  Since  X is  Jonsson,  there  is  a proper  substructure 

(H,  ejnfl,  g\H,  {/}>  -<  (M,  e,  7,  g,  {/}> 
with  | // 1 — X.  Let  (T,  e,  Z,  g,  {/}>  be  the  transitive  collapse  and 
j:  {T,  e,  Z,g,  {/}>  -<  (M,  e,  Y,  g,  {/}} 

the  inverse  of  the  collapsing  map.  Using  g,  |/f  IT  A.|  = X yet  H (T  A.  ^ X,  so  that 
j(X)  — X yet  j has  a critical  point  S < X.  The  usual 

{X  e V(S)  DT  \ S e j(X)} 

cannot  witness  the  measurability  of  8 in  L[U],  so  there  must  be  an  x e V( S)  — T. 

It  is  clear  that  (T,  e,  Z>  is  an  iterable  premouse  at  X and  f e T,  so  an 
application  of  Comparison  as  before  shows  that  / <l[u]  x.  Finally,  by  the 
argument  from  (*)  there  is  an  S e [X]^  homogeneous  for  /.  But  then,  j“S  e [A]1 
is  homogeneous  for  /,  and  the  proof  is  complete.  H 

Kunen  [68]  generalized  his  structure  results  to  models  M = L[(Ua  \ ff<y>] 
where  in  M,  Ua  is  a normal  ultrafilter  over  some  Xa,  a < /3  < y implies  that 
Xa  < Xp,  and  the  sequence  is  short:  y < Xq.  For  example,  he  showed  that  in  M 
the  Ua  s are  the  only  normal  ultrafilters  (over  any  cardinal).  Silver  [71b]  showed 
that  M satisfies  GCH,  and  there  is  a A\  well-ordering  of  its  reals.  Far  beyond 
this  first  generalization,  inner  model  theory  was  to  develop  into  a sophisticated 
area  of  set  theory  in  the  hands  of  Mitchell,  Jensen,  Dodd,  Steel,  and  others, 
encompassing  stronger  and  stronger  hypotheses  and  providing  forceful  arguments 
for  their  consistency. 
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21.  Embeddings,  0#,  and  0^ 

Iterated  ultrapowers  is  not  only  the  appropriate  technique  for  investigating  in- 
ner models,  but  also  their  embeddings.  Using  it  Kunen  established  an  elegant 
characterization  of  0#:  it  exists  exactly  when  there  is  an  elementary  embedding: 
L < L.  Reminiscent  of  the  elementary  embedding  characterization  of  measurable 
cardinals,  the  initial  isolation  of  0#  was  bolstered  by  Kunen’ s result.  Moreover, 
it  provided  a simple  paradigm  for  transcendence  over  inner  models  M in  gen- 
eral in  terms  of  their  non-rigidity,  i.e.  the  existence  of  an  elementary  embedding: 
M < M.  For  inner  models  of  measurability  Solovay  had  already  isolated  a real 
0^  analogous  to  0#  for  L.  The  existence  of  0^  is  a canonical  principle  for  tran- 
scendence over  all  inner  models  of  measurability  and  also  has  characterizations  in 
terms  of  their  non-rigidity.  This  section  explores  embeddings  of  L and  develops 
the  theory  of  0 ' in  considerable  detail. 

The  following  theorem  analyzes  Kunen’ s result  about  the  non-rigidity  of  L 
through  several  equivalences.  Strictly  speaking,  the  quantification  3 j(j:  L -<  L) 
cannot  be  formalized,  and  (b)  articulates  what  is  needed. 

21.1  Theorem  (Kunen).  The  following  are  equivalent: 

(a)  There  is  an  elementary  embedding:  L -<  L. 

(b)  For  some  a and  /3  there  is  an  elementary  embedding:  La  -<  Lp  with  a 
critical  point  less  than  |a|. 

(c)  There  is  an  L-ultrafilter  U such  that  the  ultrapower  of  L by  U is  well- 
founded. 

(d)  There  is  an  iterable  L-ultrafilter. 

(e)  0#  exists. 

Proof,  (d)  — >•  (e)  The  domain  of  each  iterate  is  again  L by  elementarity,  so  19.10 
can  be  applied  to  get  an  uncountable  set  of  indiscernibles  for  some  La. 

(e)  -*  (a)  This  is  9.17(a). 

(a)  ->  (b)  Suppose  that  k:  L -<  L with  critical  point  k.  For  any  a with 
k < | cr |,  the  satisfaction  relation  for  La  can  be  used  to  show  that  k\La  satisfies 
(b). 

(b)  — > (c)  Let  j be  an  elementary  embedding  as  hypothesized,  say  with 
critical  point  k,  and  consider  the  usual 

U = {X  eV(K)nLa  | Kej(X)}. 

T(k ) nLCL„  since  a > k+l,  so  U is  an  L-ultrafilter  (cf.  19.2).  It  remains  to 
verify  that  the  ultrapower  of  L by  U is  well-founded: 

Assume  to  the  contrary  that  for  each  n e <x>  there  is  a function  /„  e L with 
domain  k such  that 


Xn  — {£  < K I ,/n+l  (?)  £ //?(£)}  ^ U . 
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Let  y > k be  a limit  ordinal  such  that  {/„  | n e w)  c Ly,  H -<  Ly  a Skolem  hull 
of  k U {/„  | n e w},  and  N its  transitive  collapse  with  jt:  H ->  N the  collapsing 
isomorphism.  Then  N = L$  for  some  S < a (this  is  where  a > k+  is  needed, 
not  just  a > k+l).  Letting  Jt(fn)  — fn  for  new  and  noting  that  tt(|)  = § for 
§ < k,  for  such  n 


Xn={S<K  I /B+1($)  e /„($)}  e £/  . 

Since  {/„  | n e w}  c La,  it  follows  from  the  definition  of  U that 

j(fn+  l)W  e j(f„)(.K) 

for  every  n e cu,  which  is  a contradiction. 

(c)  ->  (d)  Suppose  that  U is  an  L-ultrafilter  with  corresponding  iteration 


(Ma,  Ua,  iaft) a<fiez 

where  r > 1;  note  that  each  Ma  = L.  It  will  be  established  that  in  fact  r = On. 
This  is  the  heart  of  the  matter,  and  uses  the  canonicity  of  L to  derive  full  iterability 
from  just  the  well-foundedness  of  the  first  ultrapower.  The  following  two  lemmata, 
showing  successively  that  r is  neither  a successor  ordinal  nor  a limit  ordinal, 
complete  the  proof  of  the  theorem. 

21.2  Lemma,  r is  not  a successor  ordinal. 

Proof.  Assuming  that  a < r we  must  show  that  a + 1 < r,  i.e.  the  ultrapower  of 
Ma  by  Ua  is  well-founded.  By  19.6, 

(*)  Ma  — L — {ioa(g)(Kn,  ■■■,Kyn)  I 

g e m"L  fl  L for  some  n e w and  y\  < . . . < y„  < a}  . 

Since  r > 1,  Mi  = KL/U  is  again  L,  so  a g as  in  (*)  is  of  form  g — [f]u,  where 
it  can  be  assumed  that  for  every  f < k,  f(f  ) is  a function  with  domain  [|]".  For 
such  /,  define  /:  [/c]"  ->  KL  by: 

1 /(§)«!,...  ,§„)  if  <...<!„<?,  and 
J vSl » • • • » ?n)(5)  — | 

l 0 otherwise  . 

Incorporating  this  into  the  representation  (*)  of  L,  we  show  that  the  map 
e:(L,e)  ->  {K“Ma/Ua,  EUa)  given  by: 

e(ioa([f]u)(Kyi,  ■ ■ ■ , Ky„))  = O'oa (/)(%,  • • • , Ky„))°Ua 

is  a well-defined  isomorphism,  thereby  confirming  that  the  latter  structure  is  well- 
founded: 

Taking  for  simplicity  a one-variable  formula  <p(v o)  of  C€,  note  that  as  a 
schema, 
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(T , t=)  \~  ([/"]t/ ) i^yi » • • • > ^K„)] 

iff  (L,  e,  £/>  N {{?i,  e M"  I , !„)]}  e t/"  (by  19.9(a)) 

# (i,  e,  £/>  N {{§1,  e M"  I {$  < * I ?[7«i,  - ■ . &)($)]}  e U)  e U'! 

# (Mb,  e,  £/„>  N {£  < Ka  | v[ito(7)(^.  • • ■ - *»)($)]}  e £/“  (by  19.9(a)) 
iff  {MaK“ /Ua,  EUa)  \=  <p[(ioa(7)(Kn,  ■ ■ ■ , /cy„))°J  . 

Using  atomic  formulas  <p,  it  is  simple  to  check  that  e is  well-defined  (i.e.  e(x)  for 
x e L does  not  depend  on  the  representation  of  x),  that  it  is  injective,  and  that 
it  preserves  membership.  Also,  e is  surjective:  every  member  of  MaKa  /Ua  is  of 
form  (ioc(f)(Kn, . . . , KYn))y  for  some  appropriate  / and  y\  < ...  < y„  < a. 
Hence,  e is  an  isomorphism  as  desired.  H 

21.3  Lemma,  r is  not  a limit  ordinal. 

Proof.  Assuming  that  S < x is  a limit  ordinal  we  must  show  that  S < r,  i.e.  that 
the  direct  limit  of 


(«Ma,  e,  Ua)  | a < S),  (iaP  | a < P < 8)) 

is  well-founded.  This  is  done  by  developing  a different  representation  of  the 
iteration  based  on  Skolem  hulls  constructed  uniformly  in  L.  In  order  to  formalize 
the  procedure  in  ZFC  generated  Xj  -elementary  substructures  are  used,  in  keeping 
with  the  convention  based  on  5.1(c)  that  elementary  embeddings  between  proper 
classes  are  formalized  as  ■£)  -elementary  embeddings. 

Recall  the  canonical  Skolem  terms  tv  for  <p  a formula  of  as  defined  after 
9.3.  Let  T consist  of  the  terms  in  the  closure  under  term  composition  of  the  tv' s for 
(p  a Xi  formula.  Since  the  satisfaction  relation  \=XL  for  L restricted  to  Xj  formulas 
is  definable  in  ZFC  (§0),  the  collection  of  L interpretations  {tL  \ t e T },  properly 
coded,  is  definable  in  ZFC.  Consequently,  we  can  uniformly  define  for  any  X c L 
the  Xi  Skolem  hull  ofX  in  L,  denoted  H^(X),  to  be  that  e -substructure  of  L with 
domain 

{tL(x i, . . . , xm)  | t e T is  w-ary  and  x\, . . . , xm  e X)  . 

H^(X)  is  extensional,  since  if  a,b  e H^(X)  and  3v(.r  e a A b),  then  Bx  e 
H{-(X)(x  e aAb).  It  follows  (0.4)  that  H^(X)  has  a transitive  collapse  N(X),  and 
by  definition  of  T the  inverse  of  the  collapsing  isomorphism  is  a map  ex'-  N(X)  -<i 
L,  which  by  5.1(c)  also  satisfies  ex'-  N(X)  -<„  L for  each  particular  n.  As  N(X) 
satisfies  the  sentence  ero  of  3.3(a),  it  follows  that  N(X ) = L for  proper  classes  X. 

Proceeding  with  the  proof,  set 

H — {x  e L | Va  < S(ioa(x)  — x)}  . 

Then  k c H and  H is  a proper  class.  Stipulate  for  a < S that 
Ha  = H1L(HU{Ky  I y < a})  , 

Na  is  its  transitive  collapse  , and 
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ea\  Na  < L the  inverse  of  the  collapsing  isomorphism  . 

Then  it  is  simple  to  see  that  H0  = H,  and  by  previous  remarks,  that  each  Na  = L. 
For  a < ft  < 8,  keeping  in  mind  that  I Ia  C H p , set 

— 6p  O L -<|  L . 

Then  the  system  of  embeddings  (eap  \ a < ft  < 8)  forms  a directed  system. 
Moreover,  for  any  limit  ordinal  y with  0 < y < 8,  Hy  — Ua<y  Ha  and  so 
Ny  — L is  the  direct  limit  of  {eaP  \ a < ft  < y).  Since  this  holds  for  y = 8 itself, 
it  suffices  to  show  that  iap  = eap  for  a < ft  < 8 to  complete  the  proof. 

We  first  establish  the  following  for  a < ft  <8: 

(i)  Ka  c Ha  . 

(ii)  For  any  X e V{ku)  n L,  iap(X)  = eap(X ) n kp  . 

(hi)  crit(e^)  = Ka  and  eap(Ka)  = kp  . 

For  (i),  let  § < Ka  be  arbitrary.  Then  by  19.6,  | = ioaif)(KYl,  . . . , kYii) 
for  some  /:  [/c]"  — > k with  f e L and  y\  < ...  < yn  < a.  But 
/ = e0(f)\[Kf  since  k c H,  and  so  i0a(f)  = e0( f)\[K]n  since  e0(f)  e H. 
Flence,  f = e0  (f)(/cyi, KyJ  e Ha. 

For  (ii),  note  first  that  X = ea(X)  fi/Cc  since  ea  fixes  every  member  of  Ka  by 
(i).  Flence, 


iafi  (-^0  — iafi  Fl  Ka)  — ^a(^0  Fl  Kp  — eap(X)  fi  Kp  . 

The  second  equality  follows  from  the  fact  that  iap  fixes  every  member  of  H U 
{at,,  | i]  < a}  and  hence  of  Ha , and  the  third  equality,  that  ep  fixes  every  member 
of  Kp  by  (i)  for  /3. 

(iii)  follows  from  (i)  and  by  taking  X — Ka  in  (ii). 

We  can  now  proceed  to  establish  iap  — eap  for  a < ft  < 8 by  induction  on 
yS : The  limit  case  is  immediate  since  direct  limits  are  taken.  If  ft  — a + 1,  only 
iff.a+i  = ea,a+i  need  be  verified.  For  this  purpose,  note  that 

Ma+l  — L — { ia,a+x{f)(Ka ) \ f € * L fl  L) 

by  the  usual  representation  of  ultrapowers.  We  shall  show  that  the  map  k:  Ma+\  — »■ 
Na+ 1 given  by 

k(ia,a+l  (y) (^a))  — ^a,ct+\ (.f)(^-a) 

is  an  isomorphism  between  Ma+\  = L and  Na+\  — L.  This  more  than  suffices, 
for  then  k must  be  the  identity,  and  so  for  any  x e L, 


ia,a+l(*0  — fa.o'+l  (fx){Ka)  — ^o'.Q'+l  (.fx){Ka)  — ^a,a+ 1 (-*■) 


where  fx  is  the  constant  function:  Ka  ->  {.*}. 
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An  appeal  to  (ii)  shows  that  k is  a well-defined  injection: 

ia,a+lif)i^a)  = ia,a+\ is)iKa)  Iff  Ka  G 1q'.q'+i({^  < Ka  \ fit;}  = g(^)l) 

iff  Ka  & ea,a+l{{H  < Ka  I /($)  = *(£)}) 
iff  ea,a+i (.f)(Ka)  = ea,a+\ (g)(Ka)  . 

Clearly,  = can  be  replaced  by  e throughout. 

Finally,  it  remains  to  show  that  k is  surjective.  Suppose  that  y e Na+ i.  Then 
ea+fy)  e Ha+U  and  so 

ea+\ (v)  = tL (xi , . . . , xm , Kn KVn , Ka) 

for  some  t e T (as  defined  at  the  beginning  of  the  proof),  x\, xm  e H,  and 
Y\  < ■ ■ ■ < Yn  < «•  Let  / e KaL  n L be  defined  by: 

fi%)  =tL(Xi,...,Xm,KYl,...,ICyn,Z)  . 

Then 

^a.a-t-l  if)i^a)  = ^a^-ii^aif)i^a)} 

= 1 (t  ix  1 , . . . , Xffj  , /Cyj  , . . . , Kyn,  Ka)) 

= y ■ 

We  should  justify  the  middle  equality  because  of  the  formalization  through  Hj- 
elementary  embeddings: 

Suppose  that  the  well-ordering  formula  <Poivo,  tq)  of  3.3(a)  is  Ek  where 
k > 0.  (Actually,  (po  can  be  required  to  be  Xi;  see  Devlin  [84:75].)  Then 
for  any  t(v\, . . . ,vm)  e T,  vq  — t(v  \,...,vm)  is  definable  by  a Ek+ 1 for- 
mula: Proceeding  by  induction  on  complexity  of  term  composition,  if  t is  a 
canonical  Skolem  term  where  (p  is  X),  then  the  definition  after  9.3  is  a 
Boolean  combination  (Xi  A Ilf)  A TI\,  and  hence  is  Ek+\.  Suppose  now  that 
t e T is  of  form  t^(/i(tq, . . . , vm), . . . , tr(v i, . . . , v,„))  where  <p  is  E\  and  induc- 
tively each  Do  = t,(iq, . . . , vm)  has  a X^+i  definition  tpfv  o,  tq, . . . , vm ).  Then 
v0  = t( tq, vm)  iff 

3w\  . . .3wr[(p\iw\,v\ ,vm)  A... 

A <pr(wr,  Vi, Vm)  A Vo  — tv{w i,  . . . , Wr ))  . 

The  last  conjunct  having  a Ek+i  definition  by  the  previous  argument,  the  whole 
formula  can  be  recast  as  a Ek+\  formula  by  “pushing”  existential  quantifiers  to 
the  left. 

Finally,  ea,a+i  is  Xj.+2-elementary  for  our  particular  k by  5.1(c),  so  applying 

it  to 

< Kaifif)  = tL(x i,  . . ,,Xm,Kn, %,,£)) 

yt;  A /Cq'+I  i^a,a+ 1 (./)  if)  — t iX\  , . . . , Xm  , KYl , . . . , Kyn  , i;)) 


results  in 
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since  ea:Cl+1  (k„)  — Ka+ \ by  (iii)  andx,,  ...,xm,Kyi, KYn  are  all  fixed  by  ea,a+i . 

In  particular,  we  have  the  desired  justification 

^a,a+\  (/")  = t (Xj , . . . , Xm  , KYi , . . . , Kyn  , Kf)  . 

This  completes  the  proof  of  21.3,  and  with  it,  the  theorem.  H 

Proofs  of  21.1  have  been  devised  by  Silver  (see  Devlin  [84:  V §4])  and  Paris 
(see  Kanamori-Magidor  [78:  §10])  that  generate  indiscemibles  directly  without 
relying  on  iterated  ultrapowers.  Parsimony  may  be  appropriate  were  the  theorem 
an  end  in  itself,  but  in  volume  II  generalizations  will  be  considered  for  which  the 
richer  context  of  iterated  ultrapowers  provides  the  natural  setting. 

As  foreshadowed,  we  can  now  draw  a stronger  conclusion  from  model- 
theoretic  hypotheses  discussed  in  §8: 

21.4  Corollary  (Kunen).  If  Chang’s  Conjecture  holds  or  there  is  a Jonsson  cardi- 
nal, then  0#  exists. 

Proof.  These  hypotheses  imply  21.1(b)  by  the  proofs  of  8.3  and  8.13  respectively. 

H 


There  is  another  consequence,  about  the  closed  unbounded  filter  being  an 
ultrafilter  in  a restricted  sense: 

21.5  Exercise.  The  following  are  equivalent  for  regular  k > w: 

(a)  0#  exists. 

(b)  Every  X e V{k)C\L  either  contains  or  is  disjoint  from  a closed  unbounded 
subset  of  k. 

Hint.  For  (a)  -»  (b),  suppose  that  X = tL(x  i, , xm,  y\,  y,j)  for  a Skolem 
term  t and  x\  < . . . < xm  < y\  < . . . < yn  all  in  /,  the  class  of  0#  indiscemibles, 
withx,,,  < k <y \.  Show  that  X either  contains  oris  disjoint  from  IC\(k  — (xm  + l)). 
For  (b)  — > (a),  produce  a k:  L < L.  H 

Finally,  as  noted  in  the  proof  of  9.17(a)  any  order-preserving  injection  from 
the  class  of  indiscemibles  given  by  0#  into  itself  induces  an  elementary  embedding: 
L -<  L via  Skolem  terms.  This  is  how  all  elementary  embeddings:  L < L are 
generated: 

21.6  Proposition.  Suppose  that  0#  exists,  /:  L < L,  and  I is  the  class  of  indis- 
cernibles  given  by  0#.  Then  crit(y)  e I.  Also,  j\I:  I — > I,  and  hence  j\I  induces 
j via  Skolem  terms. 

Proof.  First  observe  that  for  any  k:  L < L,  crit(k)  e I:  Let  8 — crit(k),  and 
assume  to  the  contrary  that  8 £ I.  Then 

8 = tL(x i, xm,  yi, ....  yn) 
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for  some  Skolem  term  t and  0#  indiscemibles  X\  < ...  < xm  < yi . . . < yn  with 
xm  < 8 < y\ . It  can  be  assumed  that  there  is  a proper  class  of  cardinals  fixed  by 
k.  (If  not,  switch  from  k to  ju:  L -<  L =SL/U  where  U is  the  L-ultrafilter  over 
S defined  by:  X e U iff  X e V(8)nL  A 8 e k(X)  as  in  the  proof  of  21.1,  as  ju 
also  has  critical  point  8.)  But  then,  by  the  0#  condition  (III)  it  can  be  assumed  that 
yi,  . . . , y„  are  all  fixed  by  k.  Hence,  being  definable  in  terms  of  fixed  parameters 
we  have  k(8)  = 8,  which  is  a contradiction. 

It  remains  to  show  for  the  given  j that  if  ( e /,  then  j(i)  e I.  For  this 
purpose,  it  suffices  by  the  above  observation  to  show  that  j (i)  is  the  critical 
point  of  some  k:  L -<  L.  Let  H be  a Skolem  hull  of  j (i)  U j“(I  — (i  + 1)). 
(This  is  directly  formalizable,  since  the  satisfaction  relation  for  L is  definable  in 
the  presence  of  0#.)  The  transitive  collapse  of  H is  L,  and  the  inverse  of  the 
collapsing  isomorphism  is  an  elementary  embedding  k:  L -<  L.  It  will  be  shown 
that  crit(£)  = j (i);  since  j(i)  c H,  this  reduces  to  checking  that  j (i)  ^ H: 
Assume  to  the  contrary  that  there  are  r] \ < ...  < r)m  < j (i)  such  that 

j (0  = tL(m,  ■■■,  Vm,  j(y\),  • • ■ - j(yn)) 

for  some  Skolem  term  t and  i < y i < ...  < yn  all  in  I . Then  by  the  elementarity 
of  /',  there  are  f i < . . . < < ( such  that 

L — i (^1  ’ • • ■ fm » 3^1 » ■ • • ’ yf)  • 

However,  by  simple  applications  of  the  0#  condition  (III)  each  f is  definable  in 
terms  of  members  of  I — {(},  and  hence  i itself  is  definable  in  terms  of  other 
indiscemibles.  This  is  a contradiction.  H 

The  Set 

Having  developed  a detailed  analysis  of  transcendence  over  L,  we  next  consider 
a canonical  formulation  of  transcendence  over  inner  models  of  measurability.  If 
{L[£/],  e,  U)  is  the  k -model  for  some  ordinal  k,  by  the  remarks  at  the  end  of 
§9  (slightly  amended  to  allow  U c V{k))  there  exists  under  sufficient  assump- 
tions a set  U#  c k analogous  to  0#  that  generates  a closed  unbounded  class  of 
indiscemibles  for  (L\U\,  e,  U,  However,  because  the  k -models  for  various 

k are  merely  iterates  of  each  other,  one  might  expect  a unifying  transcendence 
principle.  In  fact,  soon  after  the  isolation  of  0#  Solovay  formulated  such  a prin- 
ciple: the  existence  of  the  set  of  integers  O'  (“zero  dagger”).  The  theory  of  0^  is 
developed  in  considerable  detail  as  in  Kanamori-Awerbuch-Friedlander  [90],  first 
following  the  analysis  leading  to  0#  in  §9  and  then  providing  non-rigidity  and 
other  characterizations. 

The  idea  behind  0^  is  to  develop  a canonical  theory  for  structures  of  form 

(Lf  [t/],  e,  JJ)  \=U  is  a normal  ultrafilter  over  k 

with  two  sets  of  indiscemibles,  one  below  k and  one  above,  that  together  generate 
the  structure.  For  M a structure  and  X and  Y subsets  of  the  domain  of  M so  that 
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X U Y is  linearly  ordered  by  a relation  <,  ( X , Y,  <)  (or  in  context,  just  (X,  Y))  is 
a double  set  of  indiscernibles  for  M iff  for  every  formula  cp(v\, . . . , u„+J)  in  the 
language  of  M;  x\  < ...  < x„  and  x\  < . . . < xn  all  in  X\  and  yi  < ...  < ys 
and  y j < . . . < ys  all  in  Y, 

M\=(p[xi,...,xn,yi,...,ys\  iff  M (=  cp\xx, . . . , xn,  ylt . . . , ys]  . 

Let  £e(A)**  be  £€(A)  augmented  by  constants  {cu  \ k e co}  U {dk  \ k e co}. 
An  EM2  blueprint  is  the  theory  in  £g(A)**  of  some  structure 

(L^-[t/],  G,  U , Xk,  yk)kea> 

where  f is  a limit  ordinal  greater  than  a>;  for  some  ordinal  k,  (L( [£/],  e,  U)  U 
is  a normal  ultrafilter  over  k;  and  ({Xk  \ k e co},  {yk  \ k e co})  is  a double  set  of 
ordinal  indiscernibles  for  (Lf[C/],  e,  U)  such  that  for  every  k e co, 


xk  < xk+i  < k <yk  < yk+ 1 . 

The  canonical  Skolem  terms  tv  for  cp  a formula  of  £e(A)  and  corresponding 
Skolem  hulls  were  formulated  before  20.9. 

By  Skolem  term  is  meant  one  of  these  tf  s for  the  rest  of  this  section  . 

We  can  suppose  that  if  a structure  (M,  E,  R)  is  elementarily  equivalent  to  one  of 
form  (La[A],  e,  A)  for  some  limit  ordinal  S > oo,  then  for  any  X c M the  Skolem 
hull  of  X in  (M,  E,  R)  is  well-defined  and  given  by  these  Skolem  terms. 

The  following  is  the  analogue  of  9.4.  For  a theory  T in  £e(A)**,  temporar- 
ily let  T~  denote  its  restriction  to  £g(A).  Note  for  here  and  later  that  if  A is 
interpreted  by  a normal  ultrafilter  in  a structure,  then  1J  A is  a way  of  denoting 
the  corresponding  measurable  cardinal  in  the  structure. 

21.7  Exercise.  Suppose  that  T is  an  EM2  blueprint.  Then  for  any  a and  y , there 
is  a model  Ai  = A4(T,  a,  y)  of  T~  unique  up  to  isomorphism  such  that: 

(a)  There  is  a double  set  {X,  Y)  of  indiscernibles  for  A4  with  lUf  C OnM, 
X of  ordertype  a and  Y of  ordertvpe  y under  <M,  and  x <M  (J  AM  <M  y for 
evety  x e X and  y e Y.  Moreover,  for  any  formula  <p(v i, . . . , v„+s)  of  £e(A), 
xi  <M  . . . <M  xn  all  in  X,  and  y\  <M  . . . <M  ys  all  in  Y, 

M |=  <p[x i xn,  y i ys ] iff  cp(c0, ...,  c„-i,d0, ds- i)  e T . 

(b)  The  Skolem  hull  of  X U Y in  Ai  is  again  A4.  H 

If  A i(T,a,y)  is  well-founded,  then  its  transitive  collapse  is  of  form 
(Ls[U],  €,  U)  for  some  limit  ordinal  S > co.  In  this  case, 

A i(T,  a,  y)  is  identified  with  (Lj[f/],  e,  U)  . 

The  following  is  the  analogue  of  9.5. 
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21.8  Exercise.  Suppose  that  T is  an  EM2  blueprint.  Then  M.(T,  or,  y)  is  well- 
founded  for  eve?y  a,  y iff 

(I)  M{T , a.  y)  is  well-founded  for  every  a,  y < u>\  . ~\ 

A sufficient  hypothesis  leads  quickly  to  EM2  blueprints  with  potent  properties; 
the  following  is  an  analogue  of  9.6. 

21.9  Lemma.  Suppose  that  there  is  a K-model  for  some  k and  a Ramsey  cardinal 
greater  than  k.  Then  there  is  an  EM2  blueprint  satisfying  (I)  of  21.8. 

Proof.  Let  (L[U],  e,  U)  be  the  k -model,  (L[Ua],  Ua,  Ka,  iap)a<pe On  its  iteration, 
and  v a Ramsey  cardinal  greater  than  k.  By  the  argument  for  9.3,  there  is  a set  of 
indiscemibles  H e [v  — k]oj'  for  the  structure  (LV[U],  e,  U).  Set  X — {Ka  \ a < 
Co i}  and  Y — iom,“H.  Then  straightforward  uses  of  19.9(a)  with  ioOJl  (v)  — v as  a 
parameter  show  that  ( X , Y)  is  a double  set  of  indiscemibles  for  {Lv[U0Jl ].  e,  t/Wi). 
Hence,  (X,  Y)  determines  an  EM2  blueprint,  and  since  X and  Y are  uncountable, 
this  EM2  blueprint  satisfies  (I)  by  the  argument  for  9.6.  H 

By  20.22,  the  hypothesis  of  21.9  already  implies  that  V is  not  the  k -model 
L\U\.  Assuming  that  hypothesis,  we  can  deduce  the  existence  of  an  EM2  blueprint 
fully  analogous  to  0#.  Evidently,  a weaker  partition  property  than  Ramsey  suffices 
for  the  proof  of  21.9;  in  any  case,  on  the  basis  of  that  proof,  stipulate  that 

(i)  p < t;  are  limit  ordinals  and  (Lf  [t/],  e,U)  f=  £/  is  a normal 
ultrafilter  over  p; 

(ii)  (A,  Y)  is  a double  set  of  indiscemibles  with  X and  Y both  of 
ordertype  &>i  and  sup(A)  = p; 

(iii)  £ is  the  least  possible  satisfying  (i)  and  (ii); 

(iv)  X and  Y have  the  least  possible  &>th  elements;  and 

(v)  T()  is  the  corresponding  EM2  blueprint. 

21.10  Exercise.  The  following  conditions  hold  for  T = Tq: 

(Ila)  For  any  ( n + s)-ai y Skolem  term  t,  T contains  the  sentence: 

r(c0, . . . , c„_i,  do ds- 1)  eU^  f(co c„_i,  d0 i)  < c„  . 

(lib)  For  any  ( n + s)-ary  Skolem  term  t,  T contains  the  sentence: 

t (co,  • • • ? cn—\ , do, . . . , ds—\)  G On  > t (co , . . . , cn— i , do , . . . , d , i ) < ds  . 
(Ilia)  For  any  ( m + n + s + 1 )-ary  Skolem  term  t,  T contains  the  sentence: 

t (cq  , . . . , c,u  i n , do , , ds—  i ) < cm  ^ t (co , . . . , cm+n , do , ... . d.  i ) — 
t(c0,  ■ . • 5 Cm— 1»  ^m+n+1?  • • • •>  ^m+2n+l?  ^0?  • • • 5 ^*s—  l)  • 

(Illb)  For  any  (n  + r + s -1-  1 )-ary  Skolem  term  t,  T contains  the  sentence: 

t(.CQ, . . . , Cfi—i , do, . . . , < dr  y t(c0,  • • • 5 Cn—\i  • • • 5 ^r+5)  = 

^(^0?  • • • > Cn— 1 1 do,  • • • , dr— 1,  dr- 1-5+I,  • • • , dr-\-2s+\ ) • 
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Hint.  A simple  argument  by  contradiction  establishes  (Ha)  from  sup(A)  = p and 
the  argument  for  9.8  establishes  (lib)  by  the  minimality  of  The  argument  for 
9.10  establishes  (Ilia)  and  (Illb)  from  the  minimality  of  the  <wth  elements  of  X 
and  Y respectively.  H 

For  an  EM2  blueprint  satisfying  (I)  and  for  any  a,  y,  temporarily  let 

«X,T’a’y  !»?<«),  <4’a’r  I $ < Y))  and  xTaa-Y 

denote  the  double  set  of  indiscemibles  and  the  measurable  cardinal  of 
M (T,  a,  y)  respectively.  The  / notation  anticipates  the  following: 

21.11  Lemma.  Suppose  that  T is  an  EM2  blueprint  satisfying  (I)-(III).  Then  for 
any  fi,  8: 

(a)  {xff^'&  \ p < ft}  is  a closed  set  of  ordinals,  unbounded  in  Xp’^S  if  P A 
an  infinite  limit  ordinal. 

(b)  If  co  < a < ft  and  co  < y < 8 with  a and  y limit  ordinals,  then  for  every 
r]  < a, 

y T.a.y  T.P.8 

At]  A t) 

Proof,  (a)  (Ila)  implies  that  the  set  is  unbounded  in  if  ft  is  a limit  ordinal. 

(Ilia)  implies  that  the  set  is  closed  (by  the  argument  for  9.11,  also  used  below), 
(b)  Let  TL  be  the  Skolem  hull  of 

{X™'S  I r,  < a]  U [tT^’S  | f < y} 

in  MiT,  8).  Its  transitive  collapse  is  M(T,a,y)  by  uniqueness.  We  shall 
show  that  xf'^’S  is  a subset  of  the  domain  of  H.  This  suffices,  since  the  collapsing 
isomorphism  consequently  fixes  every  member  of  {xff^'&  \ p < a}  making  this 
set  the  lower  set  of  the  double  set  of  indiscemibles  for  MiT , a , y)  (and  thus  also 
Xa' a'r  — Xa'^'S  by  (a)  for  a as  well  as  for  f). 

To  show  that  xf’^'S  is  a subset  of  the  domain  of  H,  let  a < xf'^’S  be  arbitrary. 
Suppressing  the  superscript  T ^-s  from  the  indiscemibles  for  convenience, 

= tM{TM(xm, . . . , Xft.  • • • - t{,-i) 

for  some  Skolem  term  t and  the  indiscemibles  listed  in  increasing  order  with 
pm-\  < a < (o-  Applying  (Illb)  with  r — 0,  f,-  can  be  replaced  by  i for  i < s, 
and  then  applying  (Ilia),  can  be  replaced  by  pm-\  + i for  i < n + 1.  Since  a 
and  y are  limit  ordinals,  the  resulting  expression  shows  that  a is  in  the  domain 
of  H.  H 

21.12  Lemma.  Suppose  that  T is  an  EM2  blueprint  satisfying  (I)-(III),  co  < y < S, 
and  y and  a are  limit  ordinals  (allowing  a — 0).  Then  with  M(T,  a,  8)  = 
(L^[D],  e,  D)  say,  the  transitive  collapse  of  the  Skolem  hull  of 

{xt'a’S  \p<a}U  | I <y} 
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in  M(T,  a,  8)  is  (L,[D\,  e,  D fl  L\D~\),  where  i — ily'a'S.  Consequently, 

M{T , a,  y)  — (Ll\D\,  e,  D n L,[D ]}  and  iTfa'Y  — i^’a,S  for  every  f < y . 

Proof.  Let  Pi  be  the  stated  Skolem  hull.  Then  its  transitive  collapse  is  A4(T,  a,  y) 
by  uniqueness.  By  the  argument  for  21.11(b)  with  a = /3  (and  a simple  version 
for  a — 0),  Xa'a’S 's  a subset  of  the  domain  of  Ti,  so  that  M(T , a , y)  must  be  of 
form  {/,,[£)],  e,  D D L,[D])  for  some  i.  Checking  that  l — iy’a,s  by  the  argument 
for  9.1 1,  the  proof  is  complete.  H 

By  21.11  and  21.12,  if  T is  an  EM2  blueprint  satisfying  (I)-(III),  then  for  any 
q,  f , and  a with  a a limit  ordinal  (possibly  0),  we  can  unambiguously  set 

y2  = xj'^'y  for  any  limit  ordinals  ft  and  y with  /l  > q , 
il’a  = iTfa’Y  for  any  limit  ordinal  y > f , and 
YT'a  = | § e On}  . 

Finally,  stipulate  that  if  a is  a limit  ordinal, 

Dtu  is  the  normal  ultrafilter  over  x„  in  the  sense  of  M(T , a,  a> ) . 

The  following  is  the  analogue  of  9.12. 

21.13  Exercise.  Suppose  that  T is  an  EM2  blueprint  satisfying  (I)-(III)  and  a is 
a limit  ordinal.  Then: 

(a)  (L[Dy],  e,  D^)  is  the  xi~m°del,  and  for  f f 

(Lr,[DTal  e,  Dl)  -<  (LrfDll  e,  DTa)  . 

(b)  \XTn  I = \q\  + K0  for  evety  q,  and  \i^’a\  — |£  | + \ot\for  every  f. 

(c)  (x,f  I h e On}  and  YT'a  are  closed  unbounded  classes  of  ordinals. 

(d)  For  any  cardinal  X > a>,  Xx  — X and  if  X > a,  LTk'a  — X and  so 
M(T,a,X)  = {Lx[Dl],e,DTa). 

(e)  T is  the  only  EM2  blueprint  satisfying  (I)— (III). 

Hint.  For  (a),  note  that  21.12  implies  that  there  is  a D such  that 
M(T,  a,  y)  = (L /,«[£>],  6,00  Lt„[D ]) 

for  any  limit  ordinal  y > a>.  Hence,  (f[l)].  e,  D)  is  the  xj  -model.  By  indis- 
cemibility,  (xJ)+L[D]  < f 'U  for  any  £.  In  particular,  D C Lt.c,[D]  by  20.2(a), 
and  so  D = DJa.  The  rest  of  (a)  and  the  exercise  is  just  like  9.12.  H 

We  shall  soon  derive  more  information  about  the  x,f’s  and  iy'"’s,  incorpo- 
rating successor  a’s  into  the  scheme  using  iterated  ultrapowers.  As  with  0#,  the 
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hypothesis  of  21.13  implies  through  its  (a)  and  (d)  that  for  any  infinite  limit  ordinal 

a, 

the  satisfaction  relation  for  (L[Z)J],  e,  D is  definable  in  ZFC  . 

Because  of  this,  various  upcoming  assertions  like  the  following  about  inner  models 
are  directly  formalizable. 

21.14  Lemma.  For  any  limit  ordinal  a,  ({/J  | rj  < a},  YT'a)  is  a double  class  of 

indiscernibles  for  the  /J -model  and  the  Skolem  hull  of  [x,'  | rj  < a]  U YT’a  in  the 
model  is  again  the  model.  ~\ 

With  21.13(e)  in  hand,  stipulate  that 

0^  is  the  unique  EM2  blueprint  satisfying  (I)-(IIl) 
if  there  is  one,  and  use  the  accepted  solecism 

O'  exists 

for  r there  is  an  EM2  blueprint  satisfying  (I)-(III)1 . Through  a recursive  arithmeti- 
zation  of  £e(A)**,  0^  is  regarded  as  a subset  of  co.  The  following  summarizing 
theorem  gives  the  main  features  (with  one  direction  of  its  (a)  given  by  the  proof 
of  21.9): 

21.15  Theorem  (Solovay). 

(a)  O'  exists  iff  there  is  a k -model  for  some  ordinal  k that  has  an  uncountable 
set  of  indiscernibles  whose  minimum  element  is  greater  than  k.  Hence,  if  there  is 
a k -model  for  some  k and  a Ramsey  cardinal  greater  than  k ( e.g . if  there  are  two 
measurable  cardinals),  then  0^  exists. 

(b)  O'  exists  iff  for  every  uncountable  cardinal  X there  is  a X-model  and  a 

double  class  (X,  Y)  of  indiscernibles  for  it  such  that:  X C X is  closed  unbounded, 
Y C On  — (X  + 1)  is  a closed  unbounded  class,  X U {A.}  U Y contains  every 
uncountable  cardinal  and  the  Skolem  hull  of  X U Y in  the  X-model  is  again  the 
model.  ~\ 

In  this  last  situation,  since  there  are  regular  cardinals  and  there  are  strong  limit 
cardinals,  every  member  of  Y is  inaccessible  in  the  1 -model  by  indiscemibility. 
Hence,  as  Solovay  noted  early  on,  the  existence  of  0 ' implies  the  existence  of  set, 
transitive  e-models  of  measurability.  To  be  sure,  much  more  can  be  said  about 
the  size  of  the  indiscernibles,  along  the  lines  of  9.17. 

Solovay  also  established  the  direct  analogue  of  14.11  on  the  definability  of 

0t; 

21.16  Theorem  (ZF)(Solovay).  The  relation  R C w co  defined  by 

R(x)  **  0^  exists  A x e "2  A {m  | x(tn)  — 1}  = 0^ 

is  n\.  H 
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21.17  Corollary. 

(a)  O'  is  absolute  for  transitive  e- models  of  ZF  such  that  a>\  c M as  follows: 
M \=  r There  is  an  EM2  blueprint  satisfying  (I)-(III)1  iff  0^  e M , 

in  which  case  M \=  ^ 0^  is  the  unique  EM2  blueprint  satisfying  (I)-(IIl)1. 

(b)  O'  is  a A^  subset  of  a>  which  is  not  a member  of  the  k -model  for  any  k. 

Proof.  For  (b),  if  to  the  contrary  0 ' were  a member  of  the  k -model  for  some  k, 
then  by  iterating  ultrapowers  it  can  be  assumed  that  k is  a cardinal.  But  this  is  a 
contradiction  by  (a)  and  21.15(b).  H 

Having  developed  the  analogue  of  the  0#  theory  in  §9,  we  now  proceed  to 
derive  more  information  with  iterated  ultrapowers  that  sharpens  the  focus.  But 
first,  a respite  from  the  rigors:  Instead  of  yet  another  recipe,  we  offer  the  following 
chess  problem  (M.  Henneberger,  first  and  second  prize,  “Revista  de  Sah”  1928): 

White.  King  on  bl,  Rooks  on  b7  and  c7,  and  Bishop  on  b5. 

Black.  King  on  a8,  Rook  on  a3,  and  Pawn  on  f2. 

White  to  play  and  win. 

Send  complete  solutions  to  the  author  for  a small  prize. 

For  the  rest  of  this  section,  stipulate  that 

(L[Ua\,  Ua,  id/ 3)a</3eOn  is  the  iteration  of  the  least  k -model  , 

i.e.  with  k the  least  possible.  All  the  k -models  for  various  k appear  in  this  iteration 
(20.12).  The  results  quickly  follow  from  the  basic  uniqueness  property  of  21.7 
tailored  for  O'  and  k -models  and  stated  here  for  emphasis. 

21.18  Lemma.  For  any  a,  there  is  at  most  one  double  class  ( X , Y)  of  indis- 
cernibles  for  the  Ka-model  with  X C Ka  of  ordertype  a and  Y C On  — (k„  + 1) 
such  that: 

(a)  For  any  formula  (p{  iq, . . . , u„+:t)  of  £e(A),  x\  < . . . < x„  all  in  X,  and 
y\  < ...  < ys  all  in  Y, 

the  Kc-model  satisfies  (p[x\ , . . . , x„ , y\ , — , ys]  iff 

q){co c„_i,  <70,  • ■ ■ , ds- 1)  e 0T  . 

(b)  The  Skolem  hull  of  X U Y in  the  Ka-model  is  again  the  model. 

Proof.  If  ( X , Y)  and  (X,  Y)  were  two  such  classes,  then  the  order-preserving 
injection:  X U Y -*  X U Y extends  to  an  isomorphism  of  the  /fa-model  into  itself 
and  hence  must  be  the  identity.  H 

Dropping  the  superscript  T by  uniqueness,  in  the  presence  of  O'  there  are 
corresponding  classes 
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lxn  I e On}  and  F“  = {<“  |?e  On} 

for  every  limit  a as  defined  before  21.13  and  with  the  properties  ascribed  by  it. 
By  21.14,  for  every  limit  ordinal  a,  ({/,  | rj  < a},  Ya ) satisfies  21.1 8(a)(b)  for  the 
Xa-model.  In  particular,  the  {X,  Y)  of  21.18  need  not  determine  an  EM2  blueprint: 
X could  be  finite,  with  (0,  F°)  for  the  /o-m°del  being  an  example.  In  fact,  Y°  is 
the  basis  of  the  next  lemma,  one  that  is  not  surprising  in  view  of  the  uniqueness 
properties  of  0 t . 

21.19  Lemma. 

(a)  For  every  a,  (X,  Y)  = ({k,,  | ;;  < a},  ioa“Y°)  satisfies  21.18(a)(b)  for  the 
Ka-model. 

(b)  Xn  — Kt)  for  every  i],  and  ioa“Y°  — Ya  for  every  limit  ordinal  a. 

Proof  We  first  argue  that  xo  = kq.  Clearly,  kq  < xo  by  definition  of  Kq.  For  the 
converse,  we  work  in  some  generality  for  a later  inference:  Let  A be  a regular 
cardinal  such  that  — X — xx',  there  are  arbitrarily  large  such  A by  19.7(b) 
and  21.13(d).  Note  for  what  follows  that  by  21.14,  ({/,,  | ;/  < 7},  Yx)  satisfies 
21.1 8(a)(b)  for  the  A-model.  By  19.10,  if  £ is  the  proper  class  of  cardinals  greater 
than  X fixed  by  iox,  then  {[k^  \ r]  < X},  E)  is  a double  class  of  indiscernibles  for 
the  A-model.  Note  that  this  pair  satisfies  21.18(a)  for  this  model:  There  are 
infinitely  many  xfs  in  the  set  {at,;  | ;;  < A},  and  E C Yx  as  Yx  contains  every 
cardinal  greater  than  X.  Let  A f be  the  transitive  collapse  of  the  Skolem  hull  of 
{at,,  | ;;  < A}  U E in  the  A-model,  and  tt  the  collapsing  isomorphism.  Then  A f is 
again  the  A-model  since  tt  (A)  = A.  Lienee,  — x,,  for  every  r]  < A by  21.18. 
Consequently,  /ij  < Krj  for  every  77  < A since  tt  is  collapsing. 

Llaving  deduced  that  kq  = xo,  argue  as  follows  for  any  a:  Remembering 
that  (£[£/„],  e,  Ua)  is  the  /e„-model,  by  19.6  any  x e L[Ua]  is  of  form  x = 
io a(f)(.Kn, . . . , KYn)  for  some  function  / e £[£/o]  and  y\  <...<  yn  < a.  Since 
k0  — xo,  / is  definable  in  the  /co-model  from  elements  in  Y°,  and  so  ioa(f)  is 
definable  in  the  /c„-model  from  elements  in  ioff“F°.  Hence,  ({ k ^ \ ij  < a},  ioa“Y°) 
satisfies  21.18(b)  for  the  /ra-model. 

Next,  take  any  regular  cardinal  A > a such  that  K\  = X = xx-  It  follows  from 
a previous  remark  that  ({kv  \ 1 7 < A},  iox“Y°)  satisfies  21.18(a)  for  the  A-model. 
Hence,  the  elementarity  of  ia\  shows  that  ({at,7  | ?;  < a},  ioa“Y°)  satisfies  21.18(a) 
for  the  Ka -model. 

(b)  of  the  lemma  is  now  a direct  consequence  of  21.14  and  21.18.  H 

We  can  now  define  Ya  for  successor  ordinals  a by: 

Ya  = ioa“Y°  . 

These  F“’s  are  also  closed  unbounded  classes  by  the  usual  argument  from  (lib) 
and  (Illb)  of  O'.  Save  for  a small  initial  shift,  the  F“’s  coincide: 
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21.20  Exercise.  For  any  a < f: 

(a)  — iap“(Ya  fl  (P  + 1))  U (F“  — (P  + 1)),  a disjoint  union. 

(b)  If  P < min(F“),  then  Y?  = Ya. 

Hint.  Any  i e Ya  is  inaccessible  in  the  Ka  -model.  Since  the  iteration  of  inner 
models  of  measurability  beyond  the  Ka  -model  can  be  defined  in  the  model  using 
Ua,  19.7(c)  applied  there  implies  that  iap (i)  = i for  any  p < i.  H 

The  following  overall  theorem  describes  the  global  coherence  of  Ka -models 
and  their  indiscemibles. 

21.21  Theorem.  Assume  that  0^  exists.  Then  for  every  a there  is  a class  Ya  of 
ordinals  characterized  by: 

(a)  Ya  is  closed  unbounded  and  ({at,7  | q < a),  Ya)  is  a double  class  of 
indiscemibles  for  the  Ka-model. 

(b)  The  Skolem  hull  of  {k,;  | ;/  < a}  U Ya  in  the  Ka-model  is  again  the  model. 

Moreover,  for  any  a < P,  iap“Ya  — Yf  and  if  n £ is  a collapsing  isomorphism 
from  the  Skolem  hull  of  {a:);  \ q < a}  U Y^  in  the  Kp-model  into  its  transitive 
collapse,  then:  = Kn  for  q < a,  it£(jcp)  = Ka,  and  = Ya . 

Proof.  The  characterization  of  Ya  follows  from  21.18  and  the  fact  that  two  closed 
unbounded  classes  have  many  common  members. 

For  the  assertion  about  n£,  the  transitive  collapse  must  be  the  K-model  for 
some  k with  \ ij  < a},n^“Y^)  satisfying  21.18(a)(b)  for  that  k -model. 

Because  of  the  ordertype  a of  the  lower  set  of  this  pair,  the  only  possibility  by 
uniqueness  is  k = Ka,  and  the  conclusions  follow,  also  by  uniqueness.  H 

Thus,  the  existence  of  0 : leads  to  remarkable  conclusions  about  the  simple 
generation  of  inner  models  of  measurability  and  their  relation  to  each  other.  Taking 
into  account  the  absoluteness  result  21.17(a),  for  every  a the  /r„-model  is  a subclass 
of  T[0l],  and  is  moreover  definable  in  L[0^]  as  (J  M. (0^,  a,  y),  i.e.  by  taking 
the  lower  set  of  indiscemibles  of  ordertype  a and  “stretching”  the  upper  set.  The 
resulting  systems  of  indiscemibles  are  closely  interrelated  by  iterated  ultrapowers 
and  Skolem  hulls  as  described  by  21.21. 

As  expected,  there  is  a non-rigidity  characterization: 

21.22  Exercise.  The  following  are  equivalent: 

(a)  O'  exists. 

(b)  There  is  a k -model  for  some  k and  an  elementaty  embedding  of  that  model 
into  itself  with  critical  point  greater  than  k. 

Hint.  In  the  forward  direction,  for  any  a let  Ya  be  the  closed  unbounded  class 
for  the  Ka -model  as  given  by  0^  and  h any  order-preserving  injection  of  {k^  \ r)  < 
ff}Ul™  into  itself  such  that  h(K,j)  = Kn  for  every  ;/  < a and  h(i)  > i for  some 
i e Ya . Then  h induces  an  elementary  embedding  of  the  v„-modcl  into  itself  as 
desired. 
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The  converse  can  be  established  by  following  the  arguments  for  21.1: 
Suppose  that  for  some  a,  k is  an  elementary  embedding  of  the  /Ca-model 
( L[Ua\ , e,  Ua ) into  itself  with  crit(A:)  = p > Ka.  k determines  an  L[Ua]  -ultrafilter 

W = {X  e V{p)  n L[Ua\  | p e k{X)} 

over  p.  It  now  suffices  to  show: 

(*)  W is  an  iterable  Cft/cJ-ultrafilter  . 

The  argument  could  then  be  completed  by  iterating  with  W to  get  uncountably 
many  indiscemibles  for  L[Ua ] by  19.10,  and  then  proceeding  to  get  by  the 
argument  for  21.9  and  the  ensuing  analysis. 

To  establish  (*),  note  that  the  ultrapower  of  L[Ua]  by  W is  well-founded 
and  check  that  the  arguments  for  21.2  and  21.3  can  be  generalized  from  L using 
the  following  basic  observation  deriving  from  3.3(b):  If  M and  N are  transitive, 
j:  M < N,  and  _/'(£)  = £ for  f < Ka,  then  M — L[Ua]  iff  N = L[Ua].  The 
forward  direction  is  for  ultrapower  arguments,  and  the  latter,  for  Skolem  hull 
arguments.  H 

There  are  further  equivalences  as  for  2 1 . 1 ; one  of  them  figures  in  the  following 
analogue  of  2 1 .4,  which  has  a simple  argument  if  we  avail  ourselves  of  the  result 
that  if  p is  regular,  then  p+  is  not  Jonsson  (8.17). 

21.23  Exercise.  Suppose  that  there  is  a k -model  for  some  k,  and  a Jonsson  cardinal 
X > k.  Suppose  also  that  if\tc\  is  singular,  then  X > k+ . Then  0^  exists. 

Hint.  Building  on  the  outline  for  20.22,  note  that  the  current  hypotheses  together 
with  the  result  that  successors  of  regular  cardinals  are  not  Jonsson  are  enough 
to  show  without  assuming  V = L[U\  that  there  is  a regular  v,  k < v < X, 
such  that  X — > <„•  Hence,  the  embedding  i can  be  defined  as  before, 

and  then  a corresponding  L[LT\ -ultrafilter  W over  crit(i)  > k such  that  the  ul- 
trapower of  L[U]  by  W is  well-founded  by  the  argument  for  21.1(b)  — > (c). 

H 


With  more  sophisticated  techniques,  the  conditions  on  singular  |/c|  can  be 
eliminated  (Donder-Koepke  [83:248]).  The  Jonsson  cardinal  can  also  be  replaced 
by  a suitable  analogue  of  Chang’s  Conjecture  for  uncountable  languages  to  extend 
the  analogy  to  21.4.  Also,  21.5  has  a direct  analogue  in  the  present  context. 
Finally,  the  argument  for  21.6  can  be  adapted  to  show  that  every  elementary 
embedding  as  for  21.22  is  induced  by  an  h as  described  in  its  proof. 

The  existence  of  0^  does  not  imply  the  existence  of  measurable  cardinals, 
only  inner  models  of  measurability.  The  following  is  a slight  reformulation  of  an 
observation  of  Kunen  in  the  presence  of  a measurable  cardinal. 

21.24  Proposition  (Kunen  [70]).  Suppose  that  k is  a measurable  cardinal  and 
(L\U],  e,  U)  the  K-model.  Then  the  following  are  equivalent: 
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(a)  O'  exists. 

(b)  k++l[u]  < 2K. 

Proof.  The  forward  direction  is  clear;  every  (real)  cardinal  greater  than  k is  in 
the  class  YK  of  indiscemibles  for  the  k -model  given  by  0^ , and  hence  large  in  the 
model  by  simple  indiscemibility  arguments. 

For  the  converse,  first  note  that  in  the  continuing  terminology,  k = Ka  and 
U — Ua  for  some  a.  Since  the  iteration  of  {L[U],  e,  U)  can  be  defined  in  L[U] 
using  U,  19.7(a)  applied  there  implies  that  Ka+<0  < ^++L1£/].  Also,  if  F is  the 
filter  over  Ka+lo  generated  by  {*:„+„  | n e a>},  i.e. 

X e F iff  3m  e a){Ka+„  \ m < n e &>}  c X , 
then  by  19.5  L[F]  is  the  Ka+W -model. 

Suppose  now  that  W is  any  /(--complete  ultrafilter  over  k and  jw-  V -<  Mw  = 
Ult(y,  W).  Then  by  assumption 


Ka+(0  < k++l[u]  < 2K  < jw  (k ) . 

Also  {Ka+n  | n e a>}  e Mw  since  ® Mw  c Mw,  and  hence  F D Mw  e Mw  so  that 
L[F]  — L[F  fl  Mw]  is  definable  in  Mw.  Consequently, 

Mw  N There  is  a p -model  for  some  p < jw(K)  , 

so  that  by  elementarily  there  is  a p-model  for  some  p < k.  By  21.9  this  entails 
the  existence  of  0 L H 

As  Kunen  observed,  it  follows  that  if  there  is  a measurable  cardinal  k such 
that  k+  < 2K , then  0^  exists.  This  was  the  first  inkling  of  a genuine  impediment 
to  forcing  at  measurable  cardinals:  Beyond  the  restriction  from  below  of  5.17, 
the  measurability  of  k imposes  sufficient  constraints  on  V(k)  so  that  achieving 
k+  < 2K  requires  stronger  hypotheses  and  presumably  a new  forcing  approach. 
Such  an  approach  was  to  be  discovered  by  Silver  (see  volume  II). 

Kunen  soon  elaborated  his  argument  to  draw  a stronger  conclusion  from 
several  hypotheses: 

21.25  Theorem  (Kunen  [71a]  ).  Suppose  that  for  some  k at  least  one  of  the  fol- 
lowing holds: 

(a)  k is  measurable  and  k+  < 2K . 

(b)  Every’  k -complete filter  over  k can  be  extended  to  a K-complete  ultrafilter 
over  k (e.g.  k is  strongly  compact). 

(c)  There  is  a K-complete  ultrafilter  D over  k+  which  is  uniform,  i.e.  |X|  = k+ 
whenever  X e D. 


Then  for  any  X there  is  an  inner  model  of  ZFC  with  X measurable  cardinals. 
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The  building  store  of  intuitions  indicated  that  once  a conclusion  like  this  about 
a large  cardinal  concept  can  be  deduced,  presumably  a genuine  transcendence  over 
that  concept  can  be  achieved.  This  was  indeed  confirmed  by  later  developments  in 
inner  model  theory,  with  an  actual  equiconsistency  result  established  for  2 1 .25(a) 
(see  volume  II). 

Daggers 

As  for  0#,  the  formulation  of  0^  can  be  relativized  in  a straightforward  way. 
Suppose  that  a is  a set  of  ordinals  with  a = Ua.  An  EM2  blueprint  for  a is  the 
theory  of  some  structure 


{L(\U],  U ) Xk,  yk,  A,  £} k£(o,%<a 

where  £ is  a limit  ordinal;  for  some  ordinal  k > a,  [L^[U],  €,  U,  a,  £)j<„  |=  U 
is  a normal  ultrafilter  over  k;  and  ({xk  \ k e to),  {yk  \ k e u>})  is  a double  set  of 
ordinal  indiscemibles  for  {L^[U\,  e,  U,  a,  £)f<„  such  that  for  every  k e co, 

xk  < xk+i  < k < yk  < yk+ 1 . 

Let  L[A,  B]  be  the  expected  inner  model  constructible  relative  to  A and  B (the 
least  inner  model  M such  that  for  x e M,  A fi  x e M and  Bfl.i  e M)  defined 
through  a hierarchy  as  for  L[A]  (§3).  The  theory  of  inner  models  of  measurabil- 
ity developed  in  §19  can  be  adapted  in  a straightforward  way  to  iterate  models 
(L[U,  a],  €,  U,  a , %)$<a  that  satisfy  W is  a normal  ultrafilter  over  f with  k > a. 
Again,  such  a model  only  depends  on  k (and  not  on  U)  and  all  such  models  are 
iterates  of  each  other.  The  theory  for  0^  carries  over  with  minor  changes,  so 
that  from  sufficient  hypotheses  we  get  a unique  EM2  blueprint  a t for  a satisfying 
(the  analogues  of)  the  conditions  (I)-(III).  a ' through  coding  can  be  regarded  as  a 
subset  of  |a|.  The  following  analogue  of  21.15(a)  is  typical: 

21.26  Theorem.  Suppose  that  a is  a set  of  ordinals  with  a — Ua.  Then  o'1  exists 
iff  there  is  an  inner  model  M with  a e M such  that  for  some  U € M,  M \=  U is 
a normal  ultrafilter  over  k > a,  and  (M,  e,  U,  a,  t;)$<a  has  an  uncountable  set  of 
indiscemibles  whose  minimum  element  is  greater  than  k. 

As  with  sharps,  the  main  interest  is  in  reals  a;  with  our  development  having 
shifted  from  a c to  to  a e wto,  the  foregoing  is  readily  adapted  to  such  a. 

The  results  of  this  section  attest  to  the  breadth  and  fecundity  of  Kunen’s  iter- 
ated ultrapowers.  Whether  they  can  be  proved  in  other  terms,  iterated  ultrapowers 
looms  large  as  the  framework  in  which  they  were  first  conceived.  The  uncovered 
uniformity  of  inner  models  and  their  embeddings  offer  compelling  evidence  for 
the  coherence  and  efficacy  of  the  concepts  of  measurability,  0#,  and  O'. 


Broadly  speaking,  this  chapter  described  the  first  wave  of  methodical  research 
on  Ulam’s  original  concept  of  measurability  using  the  techniques  that  became 
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available  in  the  expansive  period  of  the  1960’s.  Cohen’s  creation  of  forcing  led  to 
the  relative  consistency  analysis  in  terms  of  saturated  ideals,  real-valued  measur- 
ability, and  Prikry  forcing,  and  Scott’s  result  5.5  spurred  interest  in  ultrapowers 
and  their  iterations,  leading  to  the  analysis  of  definability  in  the  enriched  contexts 
involving  elementary  embeddings:  inner  models  of  measurability,  0#,  and  0^ . This 
latter  direction  of  research  was  to  be  vigorously  pursued  by  Jensen  and  Dodd  in 
their  investigation  of  covering  properties  and  the  Core  Model  in  the  mid-1970’s 
(see  volume  II).  Before  then,  the  forcing  analysis  of  large  cardinals  was  to  be 
advanced  by  crucial  consistency  results  of  Silver  and  Magidor  involving  strong 
hypotheses  (see  volume  II  also).  In  the  next  venture  forth  beyond  the  extended 
elaboration  of  measurability,  these  hypotheses  were  just  beginning  to  be  explored, 
the  most  prominent  among  them  being  supercompactness. 


Chapter  5 

Strong  Hypotheses 


This  chapter  provides  the  basic  theory  for  the  strong  hypotheses  that  generalize 
measurability.  §22  discusses  Solovay  and  Reinhardt’s  concept  of  supercompact- 
ness, a global  reflection  property,  and  the  relation  of  supercompactness  to  strong 
compactness.  §23  describes  the  stronger  hypotheses  that  evolved  from  Reinhardt’s 
proposals:  extendibility  and  a prima  facie  extension  shown  inconsistent  by  Kunen. 
§24  then  considers  hypotheses  on  the  verge  of  that  inconsistency,  and  then  span- 
ning the  expanse,  n-hugeness  and  Vopenka’s  Principle.  Pursuing  offshoots  of  the 
theory  of  supercompactness,  §25  describes  the  combinatorial  study  of  VKy,  and 
§26  provides  the  fundamentals  of  extenders  and  related  large  cardinals,  refined 
concepts  that  were  to  lead  to  major  advances  in  inner  model  theory. 
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22.  Supercompactness 

In  the  late  1960’s,  even  as  measurability  was  being  methodically  investigated 
Solovay  and  William  Reinhardt,  as  a graduate  student  at  Berkeley,  were  charting 
out  stronger  hypotheses  using  straightforward  guiding  ideas.  Taking  the  concept 
of  elementary  embedding  as  basic  they  considered  various  generalizations  of  the 
elementary  embedding  characterization  of  measurability.  These  new  hypotheses 
had  succinct  formulations  and  clear  motivations,  and  were  soon  found  to  have  a 
simple  but  elegant  basic  theory.  Prominent  features  were  that  the  hypotheses  form 
a linear  hierarchy  in  terms  of  consistency  strength,  and  that  an  outright  inconsis- 
tency result  of  Kunen  provided  a delimiting  upper  bound.  The  basic  framework 
generated  several  avenues  of  research  inspired  by  intriguing  questions  and  natural 
possibilities  for  structural  elaboration,  and  relative  consistency  results  in  terms  of 
the  new  hypotheses  soon  began  to  be  established  for  propositions  low  in  the  cumu- 
lative hierarchy.  With  mounting  extrinsic  evidence  in  the  next  two  decades,  these 
strong  hypotheses  together  with  the  hierarchy  up  through  measurability  became 
widely  accepted  as  the  measuring  rod  of  exhaustive  principles  against  which  all 
possible  consistency  strengths  can  be  gauged.  This  section  discusses  Solovay  and 
Reinhardt’s  concept  of  supercompactness,  and  §§23,24,  the  stronger  hypotheses 
arising  from  Reinhardt’s  main  proposals.  The  basic  theory  appeared  belatedly  in 
Solovay-Reinhardt-Kanamori  [78],  which  serves  as  a general  reference  for  these 
sections;  Di  Prisco-Marek  [85a]  is  also  a useful  source. 

Strong  compactness  (§4)  was  the  one  concept  stronger  than  measurability  that 
had  been  considered  up  to  this  time,  and  Solovay  and  Reinhardt  independently 
devised  supercompactness  as  a further  strengthening  - hence  the  term.  Recall  that 
if  k is  measurable,  U a k -complete  ultrafilter  over  k,  and  j:  V < M A Ult(  V,  U ), 
then  K M c M.  This,  especially  spiced  with  normality  for  U,  leads  to  reflection 
phenomena  at  k (6.5).  Generalizing,  for  k < y, 

k is  y -supercompact  iff  there  is  a j:  V < M such  that 

(a)  crit(y')  = k and  y < j (k)  , and 

(b)  yM  C M . 

k is  supercompact  iff  k is  y -supercompact  for  every  y > k . 

Thus,  if  k < y < S and  k is  8-supercompact,  then  k is  y -supercompact,  and  k is 
k -supercompact  iff  k is  measurable.  Although  j:  V < M is  formalized  through 
our  convention  from  5.1(c)  that  -<  between  inner  models  is  -<i,  these  formulations 
are  not  directly  formalizable  because  of  the  3 j ranging  over  classes.  The  de- 
veloping theory  provides  a characterization  in  terms  of  ultrapowers  that  provides 
an  a posteriori  formalization,  but  nonetheless  the  intelligibility  of  the  concepts 
comes  from  their  elementary  embedding  formulations,  (b)  is  the  crucial  closure 
assumption,  asserting  that  arbitrary  /-sequences  of  members  of  M are  again  mem- 
bers of  M and  implying  in  particular  that  Hy+  C M.  Only  cardinals  y need  be 
considered,  but  maintaining  the  general  definition  has  various  procedural  advan- 
tages. In  (a)  the  condition  y < j{K)  is  carried  over  from  how  supercompactness 
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evolved  from  strong  compactness.  Its  inclusion  in  the  definition  is  convenient 
for  the  development,  but  it  turns  out  to  be  superfluous:  For  any  j:  V -<  M with 
crit(y)  = k and  y M C M,  some  nth  iterate  j " has  these  same  properties  and 
satisfies  y < /'(at)  (23.15  (a)).  The  relationship  between  supercompactness  and 
strong  compactness  is  later  discussed  at  some  length,  but  first  we  develop  the  basic 
theory  of  supercompactness. 

Straightforward  observations  illustrate  how  closure  conditions  lead  to  reflec- 
tion properties: 

22.1  Proposition.  Suppose  that  k is  2K -supercompact.  Then  there  is  a normal 
ultrafilter  U over  k such  that 

{a  < k | a is  measurable}  e U . 

In  particular,  k is  the  Kth  measurable  cardinal. 

Proof.  Let  j : V -<  M witness  the  2* -supercompactness  of  at.  If  U is  defined  from 
j by 

XeU  iff  X c.  k a k e j(X)  , 

then  U is  a normal  ultrafilter  over  k.  But  since  2‘  M C M,  it  is  simple  to  see  that 
every  ultrafilter  over  a:  is  a member  of  M.  Hence,  k is  measurable  in  M,  so  that 
{a  < a:  | a is  measurable}  e U by  its  definition.  H 

22.2  Proposition.  Suppose  that  k is  y -supercompact  and  2"  = a+  for  every 
a < at.  Then  2 P — fi+  for  eveiy  P < y. 

Proof.  Let  j:  V -<  M witness  the  y -supercompactness  of  k.  Arguing  as  for  5.8, 
P < y < j (k ) and  elementarily  implies  that  (2^)M  — P+M . But  Y M C M implies 
that  V(P)M  = V(P),  so  that  2^  < (2^)M  and  P+M  = p+.  H 

That  supercompactness  is  a global  reflection  property  can  be  expressed  in 
terms  of  the  notion  A V,  that  formulas  are  absolute  for  A.  A well-known 
result  of  Levy  [65]  is  that  for  any  k > co,  HK  -<\  V. 

22.3  Proposition.  If  k is  supercompact,  then  VK  -< 2 V. 

Proof.  Suppose  that  is  £2,  say  3vof(vo,  V\)  where  f is  /7i  for  simplicity,  and 
x e VK.  If  VK  \=  (p[x],  then  for  some  y e VK,  VK  |=  f[y,  x].  But  k is  inaccessible 
so  that  VK  — Hk,  and  the  aforementioned  result  of  Levy’s  implies  that  1 j/[y,  x]. 
Conversely,  suppose  that  <p[x].  Let  z be  such  that  f[z,x].  Taking  a > 
rank(z),  the  \Va\ -supercompactness  of  k implies  that  there  is  a j:  V < M with 
crit(y)  = a c and  z e (VpK))M.  Note  that  j(x)  = x e VK  — (VK)M . Since  f is 
TI\,  it  is  readily  seen  that  {Vj(K)  \=  (p[x'\)M . It  now  follows  by  elementarily  that 
VK  \=  <p[. v]  as  desired.  H 

The  preceding  three  propositions  already  follow  from  a substantially  weaker 
hypothesis  on  k\  see  26.6. 


300 


Chapter  5.  Strong  Hypotheses 


Generalizing  the  situation  with  measurability,  y -supercompactness  is  next 
characterized  in  terms  of  ultrafilters  and  ultrapowers.  This  will  provide  an  ade- 
quate formalization  of  the  concept,  and  hence  of  supercompactness.  A general 
proposition  about  closure  properties  of  ultrapowers  related  to  5.7  (noting  that 
j“K  — k there)  suggests  and  frames  the  later  work. 

22.4  Proposition.  Suppose  that  U is  an  u>\-complete  ultrafilter  over  a set  S,  and 
j:  V < M = Ult(V,  U).  Then: 

(a)  If  j“X  e M for  some  set  X and  Y C M is  such  that  |F|  < |A|,  then 
Y e M. 

(b)  For  any  y,  j“y  e M iff  Y M C M. 

(C)  n\s\+)  t m. 

(d)  U <f  M. 

Proof,  (a)  Argue  as  for  5.7(d):  Construe  Y as  {[/*]  | x e X}.  Let  h:  S — >■  V(X) 
be  so  that  [//]  = j“X.  Define  g : S ->  V by  stipulating  that  g(i)  is  that  function 
with  domain  h(i)  satisfying  g(i)(x)  = fx(i).  Then  [g]0'(x))  = [fx]  for  every 
x e X,  and  so  ran([g])  = Y. 

(b)  This  follows  from  (a). 

(c)  Let  [/]  e M be  arbitrary.  If  A = {i  e S \ \f(i)\  < |5|}  e U,  then 
there  is  an  a e |5|+  — |J{/(i)  | i e A],  so  that  j(a)  <f  [/].  Otherwise,  B = 
[i  e S | |/(i)|  > |5|}  e U,  in  which  case  there  is  an  injective  function  h on  B 
satisfying  h(i)  e f(i)  for  every  i e B,  so  that  [/;]  e [f]  — j“V.  In  either  case, 

I/I  ^ ./-( .v  • )• 

(d)  Assume  to  the  contrary  that  U e M.  Then  S — (J  U e M,  and  so 

V(S)  — U U {S  — X | X e U]  e M,  and  so  also  V(S  x S)  e M.  This  last 
set  contains  every  well-ordering  of  S,  and  so  it  is  straightforward  to  see  that 
s(|S,|+)  e M.  But  for  any  a < |S|+,  j (a)  is  the  ordertype  of  {[/]  | / e 5a},  and 
the  set  of  such  ordertypes  is  definable  from  U and  s(|5'|+),  i.e.  j“(|5|+)  e M. 
This  contradicts  (c).  H 

(b)  provides  a simple  way  of  gauging  the  closure  of  ultrapowers,  with  (c) 
establishing  an  upper  bound  in  this  direction  and  (d)  in  another. 

Suppose  now  that  j:  V < M witnesses  the  y -supercompactness  of  k.  Then 
j“y  e M,  and  (b)  above  suggests  (recalling  5.6  for  measurability)  that  we  try  to 
define  an  ultrafilter  Uj  by: 

A c Uj  iff  j“y  e j(X)  . 

Recall  that  VvS  = {x  c S \ |x|  < v}.  Properties  of  j and  M imply  that  |j“y|  < 
j (k)  in  M,  so  that  j“y  e (Vj(K)j(y))M.  Hence,  VKy  e Uj,  and  this  can  be  taken 
to  be  the  underlying  index  set  for  Uj.  The  following  properties  obtain: 

(i)  Uj  is  a /(--complete  ultrafilter  . 

(ii)  For  any  a e y,  {x  e VKy  \ a e x}  e Uj  . 

(iii)  For  any  (Xa  \ a < y)  eYUj,  {x  e VKy  \ x e f}aex  Xa } e Uj  . 
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By  (i)  and  (ii),  for  any  y £ VKy,  {x  e VKy  \ y c x)  e Uj.  For  (iii),  note  that  if 
j({Xa  | a < y))  = (Yp  | 0 < j(y)),  then  j“y  e f]a<y  j(Xa)  = f]pejy  Yf>- 

These  properties  motivate  the  following  definitions,  stated  for  later  purposes 
for  an  arbitrary  k,  set  S,  and  filter  F over  VKS.  If  ( Xt  \ i e S)  e sV(V(S)),  then 
its  diagonal  intersection  is  {x  £ V(S ) | x £ P)/6V  Xfi,  denoted  by  Ai€SXj. 

F is  fine  iff  F is  /c-complete,  and  for  any 
i £ S,  {x  £ Pk$  I t £ x}  £ F . 

F is  normal  iff  F is  fine,  and  for  any 

( X,  | i £ S)  £ SF,  AieSXi  £ F . 

Anticipating  a possible  confusion,  note  that  members  of  such  F consist  of  certain 
subsets  of  a set  S,  not  its  elements.  Also,  we  usually  deal  with  sets  S of  ordinals, 
in  which  case  VKS  — [5]<,c;  the  former  notation  has  nonetheless  become  standard 
in  the  subject.  Finally,  for  S an  infinite  ordinal  y there  are  no  normal  filters  over 
Vtoy  since  Aaey{x  £ Vmy  \ a + 1 £ x}  is  empty. 

With  fineness  providing  the  requisite  breadth,  this  new  concept  of  normality 
is  a generalization  of  the  old  one: 

22.5  Exercise. 

(a)  If  F is  a normal  filter  over  k,  then  {X  C Vkk  \ X fl  k £ F]  is  a normal 
filter  over  Vkk. 

(b)  If  F is  a normal  filter  over  Vkk,  then  F fl  V(k)  is  a normal  filter  over 

K. 

Hint.  For  (b)  note  that  if  Xa  — {x  £ Vkk  \ a c x}  for  a < k,  then  Aa<KXa  = 
K £ F.  H 

Recall  that  for  a filter  F over  a set  I and  X C /,  A is  /-"-stationary  iff 
X n Z 0 for  every  Z £ F. 

22.6  Exercise.  A fine  filter  F over  VKS  is  normal  iff  whenever  X is  F -stationary 
and  f is  a choice  function  on  X (i.e.  /(x)  £ x for  x £ X — {0}),  there  is  an  s £ S 
such  that  /-1({s})  is  F -stationary. 

Hint.  Argue  as  for  5.10.  H 

Having  produced  an  ultrafilter  from  an  embedding,  the  process  can  be  re- 
versed by  taking  an  ultrapower.  Suppose  that  k < y and  U is  a normal  ul- 
trafilter over  VKy.  Since  this  entails  k > a>  so  that  U is  co\ -complete,  let 
jo : V -<  Mu  = Ult(V,  U).  Let  id:  VKy  —*■  VKy  be  the  identity  map.  Then: 

(i)  [id]c/  = ju“y  and  v Mv  c Mv  . 

(ii)  crit (jo)  = k and  y < jL j(k)  . 

For  (i),  jo“y  C [id]f/  by  fineness,  and  22.6  implies  that  equality  holds.  That 
Y Mu  C Mu  then  follows  by  22.4(b).  For  (ii),  cri t(ju)  > k by  k -completeness. 
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But  also,  ot([id]j/)  < ju(K ) in  Mu  since  {x  c y | ot(.r)  < k]  e U,  and  y — 
o t(ju“Y)  = ot([id]j/). 

The  following  characterization  has  been  established: 

22.7  Theorem  (Solovay,  Reinhardt).  If  k < y , k is  y -supercompact  iff  there  is  a 

normal  ultrafilter  over  VK  y . H 

One  observation  that  can  now  be  made  is  that  supercompactness  does  not 
entail  the  existence  of  greater  large  cardinals: 

22.8  Exercise. 

(a)  If  k is  supercompact  and  X > k is  inaccessible,  then  V\  \ = k is  super- 
compact. 

(b)  If  Con(ZFC  + 3k  (k  is  supercompact)),  then 

Con(ZFC  + 3k  (k  is  supercompact  A -’EA.(A  > k a X is  inaccessible))).  H 

It  can  also  be  deduced  from  22.7  that  fi  is  y -supercompact1  is  AfF:  To  show 
that  it  is  I7fF,  note  that  it  can  be  rendered  as  3x{x  = VY+s  A <p(K,  y,  x))  where 
(p  asserts  that  there  is  a normal  ultrafilter  over  VKy . ry  — Vf  is  77fF  (0.2),  and 
( p can  be  taken  to  be  Zb  with  all  quantifiers  bounded  to  x = Vy+s.  To  show  that 
it  is  /7^F,  note  that  it  can  also  be  rendered  as  Vx (x  = Vy+s  — >■  c p(K , y,  x))  with  <p 
as  before. 

This  has  a useful  consequence  about  the  persistence  of  y -supercompactness: 

22.9  Exercise.  Suppose  that  k < X,  k is  y -supercompact  for  every > y with  k < 
y < X,  and  X is  supercompact.  Then  k is  supercompact. 

Hint.  Apply  22.3,  noting  that  Vk  j=  at  is  supercompact. 

Alternately,  for  S > X glue  together  a normal  ultrafilter  over  VKS  as  follows: 
Let  U be  a normal  ultrafilter  over  Vfi,  and  for  each  x e with  |x|  > k let  Dx 
be  a normal  ultrafilter  over  VKx.  Define  W by: 

X e W iff  X c Vk8  a {x  eVx.8  \\x\>k  a A n VKx  e Dx)  e U . 

Then  IT  is  a normal  ultrafilter  over  VK8.  H 

This  shows  that  22.3  is  optimal  in  the  sense  that  the  sentence  rthere  is  a 
supercompact  cardinal1  does  not  hold  in  VK  if  k is  the  least  supercompact  cardinal. 
Supercompactness  is  characterizable  directly  as  a reflection  property: 

22.10  Theorem  (Magidor  [71a]).  k is  supercompact  iff  for  every  rj  > k there  is 
an  a < k and  an  e : Va  < Vn  with  a critical  point  8 so  that  e(8)  — k. 

Proof.  For  the  forward  direction,  suppose  that  77  > at  . Let  j : V -<  M witness 
the  | Vrj | -supercompactness  of  k.  Set  j = j \Vv\  it  is  simple  to  see  that  j : V ^ -< 
(Vm)M.  By  the  closure  of  M under  | Vn\ -sequences,  Vj  = (VfiM  e M for  t;  < ;; 
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by  induction,  and  so  also  j e M and  M [=  Vq  -<  Vj(q^ . Hence,  M |=  r there 
is  an  a < jin)  and  an  e\  Va  -<  Vj (,)  with  a critical  point  <5  so  that  e(8 ) = j(icf . 
The  desired  result  now  follows  from  the  elementarily  of  /. 

For  the  converse,  suppose  that  y > k.  Let  a < k and  e:  Va  -<  VY+a>  with 
a critical  point  8 so  that  e(8)  — k.  It  follows  that  a = /3  + a>  for  a satisfying 
e(fi)  — y.  Noting  that  V( PsP)  c Va  and  e“/3  e VKy,  define  U by: 

X eU  iff  X<ZTSP  a e“/3  e e(X)  . 

Then  U is  a normal  ultrafilter  over  V$p.  But  U e Va,  and  so  e(U)  is  a normal 
ultrafilter  over  Ve(8)e(P)  — VKy.  H 

Carrying  this  further  Magidor  [71a]  characterized  the  least  supercompact  car- 
dinal in  terms  of  a second-order  Lowenheim-Skolem  property.  See  also  Kunen 
[03],  especially  its  2.9. 

Normal  Ultrafilters 

Having  characterized  y -supercompactness  in  terms  of  the  existence  of  normal 
ultrafilters  over  VKy  and  derived  some  initial  consequences  of  this,  the  basic 
theory  of  these  ultrafilters  is  now  developed  as  in  Solovay-Reinhardt-Kanamori 
[78]. 

The  first  proposition  provides  more  information  about  ultrapowers. 

22.11  Proposition.  Suppose  that  k < y,  U is  a normal  ultrafilter  over  Vk  y , 
j:  V -<  M = Ult(U,  U ),  and  \y\—X.  Then: 

(a)  For  a < y,  j“a  — [(x  fl  a x € VKy)]u  and 
a = [(ot(x  fl  a)  | x e VKy)]u- 

(h)  X<“M  c M. 

(c)  2X<‘  < ( 2x<‘)m  < j(K)  < (2X<“)+. 

Proof,  (a)  By  normality  j“y  — [id]c/  where  id:  VKy  — > VKy  is  the  identity  map. 
Now  note  that  for  a < y,  j“a  — j“y  fl  j(a)  and  a — ot ij“a). 

(b)  By  22.4(a),  it  suffices  to  show  that  j“VKy  e M.  But 

j“VKy  = VKj“y  = (VKj“y)M  e M ; 

the  first  equality  holds  as  j (x)  = j“x  for  any  x e VKy  by  k -completeness,  and 
the  second,  as  j“y  e M and  YM  c M. 

(c)  2X<‘  < (2x<‘)m  since  (V(VK),))M  = V(VKk)  by  (b)  and  M c V . 

(2x<fiM  < j(K ) since  j(i<)  is  inaccessible  in  M.  Finally  j{k)  — {[f\u  I /:  VKy  -» 
k ) so  that  j(.K)  < (2X<‘  )+.  H 

(b)  asserts  that  in  certain  situations  there  is  more  closure,  for  example  when 
cf(k)  < K. 

We  next  take  a closer  look,  when  a normal  ultrafilter  U is  defined  from  an 
embedding,  at  the  relationship  between  that  embedding  and  ju : Suppose  that 
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j:  V < M witnesses  the  y -supercompactness  of  k,  and  U is  the  normal  ultrafilter 
over  VKy  defined  from  j with  corresponding  jv:  V -<  Mv  = Ult(V,  U).  Define 
k:  My  ->  M by: 

k([fh)  = j 

22.12  Lemma.  Set  \y\  = a.  Then: 

(a)  k is  elementary,  and  j = k o jv. 

(b)  k(a)  — a for  eveiy  a < (2k  )+My , and  k(z)  = z for  eveiy 
z e P(V{VKy))  n Mv. 

(c)  If(2k<‘)+  = ( 2k<‘)+M , then  crit(ifc)  = (2X<K)+Mv. 

Proof,  (a)  This  is  straightforward  (5.13(b)). 

(b)  For  a < y,  by  22.1 1(a) 

k(a)  — A'([(ot(3T  fl  a)  \ x € VKy)]u)  — ot (j“y  fl  j(a))  — a . 

Noting  that  V(y)  c Mv,  this  implies  that  k(x)  — x for  every  x e V(y).  (In 
one  direction,  if  a e k(x),  then  k(x)  C k(y)  = y implies  that  k(a ) = a,  so 
that  k(a)  e k(x)  and  consequently  a ex.)  Noting  that  V(VKy)  c Mu  by 
22.11(b)  this  in  turn  implies  that  k{y)  = y for  every  y e V(VKy).  (In  one 
direction,  if  x e k{y),  then  k(y)  e V(7 ->k(K)k(y))M  — V(VKy)  c Mv  implies  that 
k(x)  — x by  the  previous  assertion,  so  that  k(x)  e k(y)  and  consequently  x e y.) 
The  argument  can  be  repeated  one  more  time  to  show  that  k(z)  — z for  every 
zeV(V(VKk))n  Mv. 

That  k(a)  — a for  a < (2k<k)+Mu  also  follows,  by  induction  on  a:  Assume 
that  k(a)  — a for  a < f and  fl  < (2k<'‘  )+Mu . Then  there  is  an  z e V(V(VKX))  fl 
Mv  and  a bijection  f'.z—*i 0 such  that  / e Mv.  By  elementarily,  M ^ 
k(f):  k(z ) ->  k(f)  is  a bijection.  By  the  previous  paragraph,  k(z.)  — z.  Moreover, 
for  any  y e z, 

k(f)(y)  = k(f)(k(y))  = k(f(y))  = f(y)  ; 

the  first  equality  holds  by  the  previous  paragraph,  the  second  by  elementarily,  and 
the  third  by  induction.  Hence,  k(f)  — f,  and  so  k(/3)  = f>. 

(c)  In  view  of  (b),  it  remains  to  observe  that 

qT)+Mv  < (2^)+  = (ff<K)+M  = k(( 2x<‘)+m")  . 

The  inequality  follows  from  22.11(c),  and  the  last  equality,  from  elementarily. 

H 


Suppose  now  that  j is  itself  a V < Mq  = Ult(F,  U)  where  U is  a normal 

ultrafilter  over  V^S  with  S > y.  Then 

X e U iff  X c PKS  a j0“8  e j0(X)  , 


and 


YeU  iff  Y<zVKy  a j0“y  e j0(Y)  . 
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Setting 


U\y  = {{xfly  | x e X}  \ X e U]  , 


it  readily  follows  that 

U = U\y. 

U\y  — MU)  in  a former  terminology,  where  /*([/)  = {Y  \ f~x(Y)  e U]  and 
/:  VKi 5 —>  VKy  is  given  by  fix)  = x fl  y.  The  corresponding  k\  Mv  -<  Mq  can 
be  explicated  by 


*([</(*)  I ^ e VKy)]u)  = [(fix  n y)  \x  e PK«>]p  . 

Note  that  for  S > 2X<‘ , sMjj  C Mq  implies  by  22.12(c)  that  crit(A:)  = (2X<*  )+Mu . 

We  next  consider  the  question  of  the  number  of  normal  ultrafilters,  first  raised 
for  measurable  cardinals.  If  k is  measurable,  then  by  20.11  there  is  an  inner 
model  in  which  there  is  only  one  normal  ultrafilter  over  k,  and  by  17.8  there  is 
a forcing  extension  in  which  there  is  the  maximal  possible  number  22‘  of  normal 
ultrafilters  over  k.  ff  at  is  2K -supercompact,  then  by  5.16  and  22.1  there  are  at 
least  two  normal  ultrafilters  over  k,  one  containing  [a  < k \ a is  measurable} 
and  the  other  not.  In  1967  Solovay  and  also  Kanji  Namba  showed  that  there 
are  (2*)+  normal  ultrafilters.  A few  years  later,  Solovay  showed  that  in  fact 
there  are  2?K  normal  ultrafilters  over  k.  The  question  of  whether  the  number  of 
normal  ultrafilters  over  some  k can  be  intermediate  between  1 and  22*  was  settled 
by  Mitchell  [74].  In  the  first  development  in  inner  model  theory  for  hypotheses 
stronger  than  measurability,  he  showed  that  assuming  sufficient  hypotheses  about 
k (2K -supercompactness  suffices),  if  r < k or  is  one  of  the  terms  k+  or  k++, 
there  is  an  inner  model  in  which  there  are  exactly  r normal  ultrafilters  over  k. 
In  the  model  with  exactly  two  normal  ultrafilters  over  a k,  one  contained  the  set 
[a  < k | a is  measurable}  and  the  other  did  not.  Stewart  Baldwin  [85]  built  on 
Mitchell’s  work  to  show  that  assuming  similarly  strong  hypotheses  about  k,  if 
t < k,  there  is  an  inner  model  in  which  there  are  exactly  r normal  ultrafilters 
over  k and  k is  the  only  measurable  cardinal.  It  is  not  known  how  to  get  such 
relative  consistency  results  assuming  only  that  k is  measurable  at  the  outset. 

Solovay’ s result  mentioned  above  is  now  established  in  a general  form;  it  is 
a consequence  of  a structural  result  that  has  an  argument  having  aspects  of  self- 
reference as  well  as  reflection.  Again,  for  an  a>\ -complete  ultrafilter  W,  jw:  V -< 
Mw  = Ult(V,  W). 


22.13  Theorem  (Solovay  ).  If  k < y and  k is  2 1 K 1 * -supercompact,  then  for  any 
A e V(V(VKy))  there  is  a normal  ultrafilter  W over  VKy  such  that  A e Mw- 

Proof.  Let  U be  a normal  ultrafilter  over  Vk(2^y^<k)  and  j:  V ~<  M = Ult(V,  U). 
Let  (p( vo,  Vi,  V2)  be  a formula  such  that  <p[A,/c,  y]  iff  A e V(V(VKy))  but  for 
any  normal  ultrafilter  W over  VKy,  A £ Mw.  <p  can  be  assumed  to  have  the 
property  that  3i>o <p[k,  y]  exactly  when  M |=  3r>o <p[k,  y]:  Since  V(V(VKy))  c M, 
for  any  normal  ultrafilter  W over  VKy,  W e M.  Moreover,  V(V(VKy))  fl  Mw  is 
definable  in  M since  every  function:  VKy  —*■  VK  is  in  M,  and 
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V(V(VKy))  n Mw  c (Vy+4)m»  c (V*w)""  = yV(^)  • 

Assume  now  to  the  contrary  that  3t>o (p[K,  y].  Then  M \=  y].  Mindful 

of  22.12  and  the  discussion  following  it,  let  U\y  be  the  normal  ultrafilter  over 
VKy  corresponding  to  U with  j0:  V -<  M0  = Ult(V,  U\y)  and  k:  M0  -<  M such 
that  j = k o jo.  Since  k(s:)  = k and  k(y)  — y,  by  elementarity  of  k there  is 
an  Ao  e Mq  such  that  Mo  |=  <p[Ao,  k,  y].  Since  k(Ao)  = Aq,  it  follows  that 
M 1=  cp[Ao.  k,  y],  and  hence  (p[Ao,  k,  y].  But  Ao  e Mo  = Mv \y.  Contradiction! 


22.14  Corollary.  If  k < y and  k is  2^<K  -supercompact,  then  there  are  22'1 
normal  ultrafilters  over  VKy.  Hence,  if  k is  2K -supercompact,  then  there  are  2?" 
normal  ultrafilters  over  k. 

Proof  Set  \y\  = X.  Note  first  that  if  IT  is  a normal  ultrafilter  over  VKy,  then 

\V{V{VKX))  n Mw | < (22"<K)Mw  < jw(K)  < (2X<')+ 

by  the  argument  for  22.11(c).  By  22.13, 

V{V(VKy))  — \J\V(V(VKy))  fl  Mw  | IT  is  a normal  ultrafilter  over  VKy)  . 

Hence,  it  follows  that  there  must  be  \VfP(VicY))\  — 22‘  normal  ultrafilters  over 
VKy. 

The  second  assertion  follows  from  22.5(b),  noting  that  k<k  = k.  H 

Historically,  Magidor  [71]  had  first  shown  that  if  k is  supercompact,  then 
there  are  (2^1  )+  normal  ultrafilters  over  VKy.  A further  corollary  provides 
another  measure  of  the  wealth  of  normal  ultrafilters: 

22.15  Exercise.  If  k < y and  k is  2^  * -supercompact,  then 

(2,yl  *)+  = sup({  jvy (v)  | IT  is  a normal  ultrafilter  over  VKy})  . 

Hint.  One  direction  follows  from  22.11(c).  For  the  converse,  suppose  that  j < 
(2lxC')+.  Let  A e V(V(VKy))  be  such  that  % < (2lyl<lt)+i[Al,  and  by  22.13,  IT  a 
normal  ultrafilter  over  VKy  such  that  A e Mw-  Show  that  £ < jw(j<)-  H 

The  proof  of  22.14  shows  that  if  k is  2K -supercompact,  then  there  are  2?" 
normal  ultrafilters  over  k containing  the  set  {a  < k \ a is  measurable}.  But 
paradoxically,  the  following  question  remains  open: 

22.16  Question.  If  k is  2K  -supercompact,  does  it  follow  that  there  is  more  than 
one  normal  ultrafilter  over  k containing  the  set  {a  < k \ a is  not  measurable}? 

The  theory  of  fine  and  normal  ultrafilters  over  VKy  has  been  pursued  along 
these  lines  by  Julius  Barbanel  [82,  82a,  85]  and  Yoshihiro  Abe  [84].  The  main 
avenues  of  related  research  concern  combinatorial  aspects  of  VKy  (§25). 
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On  Strong  Compactness 

Having  developed  the  basic  theory  of  supercompactness,  one  based  on  simple 
reflection  arguments,  we  now  turn  to  a contextual  characterization  of  strong  com- 
pactness and  discuss  its  relationship  to  supercompactness,  continuing  to  follow 
Solovay-Reinhardt-Kanamori  [78]. 

In  the  study  of  strong  compactness  fine  ultrafilters  over  VK  y come  naturally 
into  play  as  in  the  proof  of  4.1.  Define  for  to  < k < y: 

k is  y -compact  iff  there  is  a fine  ultrafilter  over  VKy  . 

Hence,  if  k is  y -supercompact,  then  k is  y -compact;  and  k is  k -compact  iff  k is 
measurable.  Also,  if  k < y < S and  k is  5-compact  with  U a fine  ultrafilter  over 
Vk8,  then  U\y  is  a fine  ultrafilter  over  VKy,  and  thus  k is  y-compact.  Fineness 
without  normality  has  the  following  characterizations.  Recall  that  for  a set  S and 
E c V(S),  the  filter  over  S generated  by  E is  {X  c S | 3T  e E(Y  c X)} 
provided  that  it  is  proper  and  non-principal. 

22.17  Theorem.  If  k < y,  the  following  are  equivalent: 

(a)  k is  y -compact. 

(b)  There  is  a j:  V -<  M with  crit(y)  = k such  that:  for  any  X C M with 

|X|  < y,  there  is  a Y e M such  that  F3X  and  M \=  \Y\  < j(K). 

(c)  For  any  set  S,  eve?y  k -complete  filter  over  S generated  by  at  most  \y\ 
sets  can  be  extended  to  a K-complete  ultrafilter  over  S. 

Proof,  (a)  — > (b)  Let  U be  a fine  ultrafilter  over  VKy  and  j:  V < M = Ult(y,  U). 
Suppose  that  X c M with  |X|  < y,  say  X — {[fa]u  \ a < y}.  Define  F:  VKy  -> 
V by  F(x)  — { fa(x ) | a e x},  and  set  Y — [flu-  Then  F3X  by  fineness,  and 
M \=  \Y\  < j (k ) . Finally,  note  that  k -completeness  and  the  preceeding  argument 
with  X — k shows  that  crit(y)  = k. 

(b)  — >■  (c)  Suppose  that  F is  a /c-complete  filter  over  a set  S generated  by  a 
E C V{S)  with  \E\  < y.  By  (b)  let  Y 3 j“E  with  Y € M and  M \=  \Y\  < j(K). 
In  M , j ( F)  is  j(K) -complete  and  j(F)  n Y is  a subset  of  cardinality  less  than 
j(i<).  Hence,  there  is  a c e M so  that  c e f](j(F)  n F).  c generates  an  ultrafilter 
C7  by:  X e U iff  X c.  S Ace  j (X).  It  is  simple  to  check  that  U is  a k -complete 
ultrafilter  over  S extending  F . 

(c)  — > (a)  Extend  the  k -complete  filter  over  VKy  generated  by  the  family 

{{x  eVKy  | a e x)  \ a < y}  to  a ^-complete  ultrafilter  over  VKy.  H 

22.18  Corollary.  k is  strongly  compact  iff  k is  y -compact  for  every  y > k. 

Proof.  By  4.1.  H 

We  now  see  how  the  concept  that  came  after  is  the  mightier:  Solovay  and 
Reinhardt  independently  devised  supercompactness  to  ensure  Y = X in  22.17(b) 
through  the  normality  condition.  This  led  to  the  simple  transcendence  as  given 
by  22.1,  and  provided  a provisional  answer  to  a question  of  Tarski:  Is  the  least 
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strongly  compact  cardinal  necessarily  greater  than  the  least  measurable  cardinal? 
The  Vopenka-Hrbacek  result  5.8  together  with  the  existence  of  inner  models  for 
measurability  had  shown  that  measurable  cardinals  need  not  be  strongly  compact 
and  Kunen’s  result  21.25,  that  strong  compactness  is  strictly  stronger  than  measur- 
ability in  consistency  strength.  Yet,  Tarski’s  question  had  remained  unanswered. 

Having  made  the  conceptual  move  to  supercompactness,  the  problem  was 
shifted.  While  the  process  of  producing  a normal  ultrafilter  over  a measurable 
cardinal  k was  a simple  one,  trying  to  produce  a normal  ultrafilter  over  VK  y from 
a fine  one  met  with  unexpected  difficulties.  Still,  with  so  little  experience  strong 
compactness  and  supercompactness  had  the  same  feel,  and  Solovay  conjectured 
that  they  are  equivalent.  This  conjecture  was  to  be  refuted,  but  it  inspired  much 
of  the  early  work  in  this  area.  In  a few  years,  with  mounting  extrinsic  evidence 
provided  by  relative  consistency  results  supercompactness  came  to  be  accepted 
as  the  proper  generalization  of  measurability  in  the  emerging  hierarchy  of  large 
cardinals. 

Strong  compactness  was  nonetheless  to  remain  a significant  hypothesis,  par- 
ticularly because  of  its  combinatorial  consequences.  Jussi  Ketonen  [72]  clarified 
the  concept  with  another  characterization:  If  k < X are  both  regular,  then  k is  X- 
compact  iff  for  every  regular  jx  such  that  k < fx  < X there  is  a uniform  K-complete 
ultrafilter  over  /x.  This  confirmed  a conjecture  of  Kunen,  and  reduced  strong  com- 
pactness to  having  uniform  /r-complete  ultrafilters  over  arbitrarily  large  cardinals. 
Before  this,  Solovay  had  made  an  initial  connection  between  supercompactness 
and  the  Singular  Cardinals  Problem,  and  using  an  idea  in  Ketonen’ s proof,  he  was 
able  to  refine  that  connection  to  the  following  solution  from  strong  compactness 
(see  volume  II):  If  k is  X+ -compact  and  X is  a singular  strong  limit  cardinal  > k, 
then  2X  — X+ . 

Solovay’s  conjecture  that  strong  compactness  and  supercompactness  are 
equivalent  was  refuted  by  his  student  Telis  Menas  in  late  1972. 

22.19  Theorem  (Menas  [74]). 

(a)  If  k is  measurable  and  a limit  of  strongly  compact  cardinals,  then  k is 
strongly  compact. 

(b)  If  k is  the  least  cardinal  as  in  (a),  then  {a  < k \ a is  measurable]  is  not 
a stationary  subset  of  k.  In  particular,  k is  not  2K  -supercompact. 

Proof,  (a)  Let  U be  a ^-complete  ultrafilter  over  k with 

A — {a  < k | a is  strongly  compact]  e U . 

(Such  a U is  simple  to  produce:  If  W is  any  a: -complete  ultrafilter  over  k and 
f:K->Ais  injective,  take  U — {X  c k \ f~l(X)  e W}.)  Suppose  now  that 
y > k.  For  each  a e A let  Ua  be  a fine  ultrafilter  over  Vay,  and  define  D by: 

X e D iff  X<zVKy  a {a  e A \ X n Vay  e Ua}  e U . 

Then  it  is  simple  to  check  that  D is  a fine  ultrafilter  over  VKy . 
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(b)  Set  A — {a  < k \ a is  measurable}.  Define  / e kk  by: 

f(a ) = sup({/6  < a | (3  is  strongly  compact})  . 

Then  by  hypothesis  on  at,  / is  regressive  on  A,  yet  f cannot  be  constant  on  any 
unbounded  subset  of  k.  Hence,  A is  not  stationary.  It  follows  that  A is  not  in  any 
normal  ultrafilter  over  k , and  hence  by  22.1  that  k is  not  2K -supercompact.  H 

The  D in  the  proof  of  (a)  illustrates  how  fine  ultrafilters  can  be  composed  with 
relative  ease,  and  (b),  how  normality  with  the  consequent  reflection  properties  is 
another  kettle  of  fish.  Abe  [85]  observed  that  D is  not  normal.  23.8  will  provide 
measurable  limits  of  strongly  compact  cardinals. 

Menas’s  1973  Berkeley  thesis  substantiated  the  gap  between  fine  and  normal 
ultrafilters  in  various  ways,  and  made  contributions  to  the  emerging  study  of  VKy 
combinatorics  ([74,76]  - see  §25).  Much  of  the  work  was  devoted  to  forcing;  he 
developed  methods  for  preserving  supercompactness  [76a],  and  having  broached 
the  possibility  with  22.19,  established  the  relative  consistency  of  there  being  ex- 
actly one  strongly  compact  cardinal  and  it  failing  to  be  supercompact  ([74]). 

Menachem  Magidor,  following  upon  his  1972  Hebrew  University  thesis  on 
supercompactness  from  which  [71,71a]  have  already  been  cited,  established  the 
principal  results  on  the  varying  nature  of  strong  compactness.  A driving  force 
in  large  cardinal  theory,  Magidor  quickly  became  a major  set  theorist.  His  early 
and  unabashed  use  of  supercompactness  in  several  relative  consistency  results 
confirmed  their  significance  and  efficacy  and  steered  set  theoretic  research  upward 
to  the  investigation  of  consistency-wise  strong  propositions.  By  mid- 1972,  he  [76] 
had  established: 

If  k is  supercompact,  then  there  is  a forcing  extension  in  which  k remains 
supercompact  and  is  also  the  least  strongly  compact  cardinal  . 

Then  at  the  beginning  of  1973,  motivated  by  Menas’s  22.19  and  relative  consis- 
tency result,  Magidor  [76]  established: 

If  k is  strongly  compact,  then  there  is  a forcing  extension  in  which  k 
remains  strongly  compact  and  is  also  the  least  measurable  cardinal  . 

This  result  separated  strong  compactness  from  supercompactness  in  the  strongest 
possible  way,  and  rather  unexpectedly  answered  the  aforementioned  question  of 
Tarski  in  the  negative.  See  volume  II  for  proofs  of  Magidor’s  results. 

Several  basic  questions  about  strong  compactness  have  remain  unanswered. 
The  following  was  not  resolved  by  Magidor’s  consistency  results: 

22.20  Question.  Are  the  following  theories  equiconsistent? 

(a)  ZFC  + 3k (k  is  supercompact). 

(h)  ZFC  + 3k(k  is  strongly  compact). 
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That  there  are  many  normal  ultrafilters  over  k was  established  by  a reflection 
argument  whose  inflexibility  led  to  the  question  22.16;  are  there  other  means 
available? 

22.21  Question.  If  k is  strongly  compact,  does  it  follow  that  there  is  more  than 
one  normal  ultrafilter  over  k ? 

Finally,  the  following  question  is  related  to  22.2  and  concerns  the  possibilities 
with  forcing: 

22.22  Question  (Woodin).  If  k is  strongly  compact  and  2V  = v+  for  every’  v < k, 
then  does  GCH  hold? 

§§25,26  describe  developments  in  different  directions  related  to  the  analysis 
of  supercompactness.  The  first  pursues  the  combinatorial  study  of  (PKy,  C>  based 
on  analogies  with  (at,  <)  initially  seen  in  connection  with  normal  ultrafilters.  The 
second  develops  a basic  analysis  of  elementary  embeddings  and  corresponding 
weak  versions  of  y -supercompactness.  Unlike  for  measurability  a canonical  inner 
model  theory  for  y -supercompactness  is  not  available,  with  naive  attempts  failing, 
e.g.  if  k < X and  U is  a normal  ultrafilter  over  VKX,  then  L[U\  = L (25.8). 
The  approach  of  §26  does  lead  to  a successful  inner  model  theory  for  reasonably 
substantial  hypotheses. 

As  set  theory  advanced  in  the  1970’s  to  the  study  of  consistency-wise  strong 
combinatorial  propositions,  various  forcing  results  soon  established  supercompact- 
ness as  a hypothesis  of  sufficient  strength  for  relative  consistency.  However,  in 
addition  to  lacking  an  inner  model  theory  for  the  sharper  study  of  consistency 
strength,  supercompactness  did  not  seem  to  have  substantial  direct  consequences 
for  definable  sets  of  reals  beyond  those  of  measurability.  An  early,  optimistic 
conjecture  of  Solovay  [69:  60]  was:  if  there  is  a supercompact  cardinal,  then  ev- 
ery set  of  reals  in  L(M)  has  the  perfect  set  property.  In  remarkable  developments 
in  the  mid-1980’s,  this  and  much  more  (ADi(R)  - see  §32)  were  confirmed  and 
from  the  weaker  hypotheses  of  §26.  However,  it  was  in  the  expansive  setting  pro- 
vided by  supercompactness  that  the  crucial  techniques  were  first  developed,  with 
the  weaker  hypotheses  only  emerging  in  the  effort  to  sharpen  implications.  With 
equiconsistency  results  established  for  these  hypotheses  and  continuing  progress 
in  their  inner  model  theory,  there  is  no  doubt  left  as  to  the  intrinsic  necessity  of 
strong  hypotheses  based  on  elementary  embeddings  with  closure  conditions. 
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The  large  cardinal  hypotheses  stronger  than  supercompactness  evolved  from  the 
proposals  of  William  Reinhardt.  A recurring  theme  of  his  work  was  to  investigate 
the  concept  of  set  together  with  broad  notions  of  class  or  property.  He  considered 
various  extensions  of  ZFC  based  on  such  notions,  and  just  as  an  inaccessible 
cardinal  provides  a natural  model  of  ZFC,  strong  large  cardinal  hypotheses  were 
motivated  as  providing  natural  models  for  the  emerging  theories. 

In  his  1967  Berkeley  thesis  Reinhardt  (see  [70])  established  results  about 
Ackermann’s  set  theory  A.  With  motivations  quite  different  from  those  for  ZFC, 
A provides  for  a universe  with  extensionally  determined  entities,  classes  in  a 
broad  sense,  together  with  a predicate  for  sethood.  With  that  predicate  rendered 
as  x € V,  the  main  schema  of  A is:  If  X c V is  definable  using  only  parameters 
from  V (and  without  the  predicate  V ),  then  X e V.  Let  A*  be  A augmented  by 
the  Axiom  of  Foundation  for  members  of  V.  Levy  and  Vaught  had  observed  that 
much  as  for  ZF,  A*  is  consistent  relative  to  A,  but  also  that  in  A*  one  can  prove 
the  existence  of  classes  like  {V},  V(V),  and  V(V(V)).  On  the  other  hand,  Levy 
had  shown  that  if  the  relativization  to  V of  a sentence  a of  Ce  is  provable  in 
A*,  then  a is  provable  in  ZF.  Reinhardt  then  established  the  converse,  so  that  in 
particular  A*  and  ZF  are  equiconsistent.  He  also  showed  that  ii  Q < -q  < a < f 
and 

(*)  (Va+ti,  e,  x)x€Va  = {Vp+ri,  e,  x)x€v„ 

(where  = denotes  elementary  equivalence),  then  (Vp+n,  e,  Va)  is  a model  of  A*, 
with  Va  interpreting  the  predicate  V for  sethood.  Levy  had  established  this  result 
for  (7=1;  Reinhardt  observed  that  the  existence  of  0 < ;/  < a < satisfying  (*) 
follows  from  a sufficient  indescribability  assumption  on  /S  of  the  sort  discussed  in 
§6  for  the  motivating  case  q e a>. 

At  the  end  of  his  thesis  Reinhardt  considered  an  axiom  that  was  soon  proved 
inconsistent  by  Kunen,  a dramatic  turn  of  events  that  will  be  discussed  later  in 
this  section.  That  axiom  was  a prima  facie  extension  of  a proposal  for  extending 
Ackermann’s  set  theory,  a proposal  which  can  be  cast  within  set  theory  as  follows: 
Extend  (*)  above  in  the  motivating  case  q — 1 by  incorporating  for  each  “class” 
X C.Va  a corresponding  interpretation  j (A)  C Vp  to  get 


(Va+i,  e,  X,  X)x€V„,XQVa  = {Vp+l,G,X,  j(X))xeVa,XQV„  ■ 

It  is  simple  to  see  that  this  amounts  to  asserting  that  j : Va+\  < Vp+\  with  crit(y')  = 
a.  This  is  how  Silver  formulated  Reinhardt’s  idea,  and  then  Reinhardt  considered 
arbitrary  rj  > 0: 

k is  -extendible  iff  there  is  a f and  a j : VK+rl  -< 

with  crit(j)  = k and  i j <j(K)  . 
k is  extendible  iff  k is  -extendible  for  every  rj  > 0 . 
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Thus,  embeddings  are  postulated  that  are  sets,  but  whose  domains  and  target 
structures  have  the  ultimate  closure  property:  they  are  initial  segments  of  the 
universe.  Since  ;/  > k ■ co  implies  that  k + p = rj,  the  specific  form  of  the  first 
definition  is  pertinent  only  for  small  rj.  For  tj  < k it  is  clear  that  f = j (at)  + i],  so 
that,  generalizing  the  sense  from  ;/  < co,  ^-extendibility  asserts  that  (rj+  l)st-order 
properties  are  preserved  between  VK  and  Vj(K)  in  a strong  way.  The  condition 
ri  < j (k ) evolved  from  the  r)  — 1 case;  it  is  convenient  to  include  it  in  the 
definition,  but  it  is  superfluous  for  full  extendibility  (23.15(b)). 

Silver’s  1 -extendibility  is  already  quite  strong: 

23.1  Proposition.  If  k is  1-extendible,  then  k is  measurable,  and  there  is  a normal 
ultrafilter  U over  k such  that 

{a  < k | a is  measurable}  e U . 

Proof.  Let  j:  VK+\  -<  Vj(K)+ \ with  crit(j)  = k.  Since  V(k)  c Vk+\,  a normal 
ultrafilter  can  be  defined  from  j as  usual  by 

X eU  iff  X<zK  a k e j(X)  . 

But  U e V j(K)+i  so  that  Vj(K)+ 1 1=  k is  measurable,  and  consequently 
{a  < k | a is  measurable}  e U by  its  definition.  H 

We  shall  soon  establish  that  extendibility  implies  supercompactness,  but  also 
that  these  concepts  are  closely  intertwined.  However,  while  supercompactness 
was  arrived  at  rather  pragmatically  as  a strengthening  of  strong  compactness, 
Reinhardt’s  motivations  for  extendibility  were  based  on  his  analyses  of  the  concept 
of  set.  We  continue  to  chart  his  work  in  order  to  describe  these  motivations  in 
context. 

In  [74]  Reinhardt  formulated  a set  theory  S based  on  a set-existence  principle 
of  Shoenfield.  S also  provided  for  a universe  of  extensionally  determined  entities 
and  had  a predicate  V,  and  in  effect  extended  Ackermann’s  A.  For  S however, 
Reinhardt  now  viewed  V as  the  collection  of  “existing”  sets  and  the  universe  as 
consisting  of  “imaginable”  sets  - things  and  imaginings.  Shoenfield’s  principle 
was  rendered  as:  If  X C Y for  some  (imaginable)  Y and  X is  definable  using 
only  parameters  in  V,  then  X e V.  With  S*  bearing  the  analogous  relation  to  S 
that  A*  does  to  A,  Reinhardt  observed  that  the  relativization  to  V of  a sentence 
a of  C€  is  provable  in  S*  exactly  when  a is  provable  in  ZF. 

Reinhardt  then  extended  S*  to  a further  theory  S+  by  enriching  the  notion  of 
property.  He  first  introduced  a predicate  U for  the  imaginable  sets  and  a further 
predicate  P for  “existing  properties  of  sets”.  The  main  new  principle  that  he 
now  proposed  was  that  each  imaginable  set  consisting  of  existing  sets  corresponds 
to  an  existing  property  of  sets,  i.e.  for  any  w c V,  there  is  a Q e P such 
that  w — V fl  Q.  Reinhardt  gave  contextual  and  other  arguments  for  this  broad 
interpretation  of  property.  He  then  showed  that  a structure  of  form  (Vp,  D,  e,  Va), 
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where  Vp  interprets  U,  D c Vp+\  interprets  P,  and  Va  interprets  V,  is  a model 
of  S+  iff 

(a)  If  Q'  is  definable  in  {Vp,  e,  Q)q^d,  then  Q'  e D , 

(b)  D fl  Vp  — Va  , and 

(c)  Va+1  = {QnVa  | Q e D}  . 

(a)  provides  for  the  requisite  store  of  properties,  (b)  expresses  Shoenfield’s  prin- 
ciple in  the  presence  of  (a),  and  (c)  expresses  Reinhardt’s  new  principle.  He 
showed  that  these  conditions  imply  that  a is  measurable  with  a normal  ultrafil- 
ter containing  the  set  {§  < a | £ is  measurable},  and  as  is  simple  to  check,  if 
j : VK+\  < Vj(K)+ 1 witnesses  the  1 -extendibility  of  k,  then  (Vj(K),  j“VK+ i,  e,  VK) 
satisfies  the  conditions. 

[74a]  was  Reinhardt’s  most  extensive  and  explicit  statement  about  motivations 
for  large  cardinal  axioms  based  on  the  concept  of  set.  Proceeding  informally  this 
time  from  the  vantage  point  of  ZFC,  he  advanced  ideas  of  the  sort  that  he  had 
pursued  in  terms  of  more  elaborate  formalizations.  After  considering  the  role  of 
the  class  On  in  reflection  arguments  for  motivating  the  inaccessible,  Mahlo  and 
indescribable  cardinals,  Reinhardt  introduced  V and  On  as  constants  in  the  spirit 
of  an  imagined  projection  of  ideas  over  new  possibilities  like  Vbn+i  and  Von+On- 
He  argued  that  since  Cantor  through  his  doctrine  of  the  Absolute  had  intended  the 
universe  of  sets  to  comprehend  all  possibilities,  the  theory  of  the  new,  formally 
projected  universe  of  imaginary  sets,  even  allowing  parameters  from  V,  ought  to 
be  the  same: 

Vx  e V (<pv[x]  **  ^[x])  . 

Reinhardt  next  extended  these  considerations  into  class-set  theory  with  arbi- 
trary classes  X c V.  He  argued  that  a proper  class  X is  essentially  different  from 
a set  x in  that  if  there  were  more  ordinals,  then  x would  have  exactly  the  same 
members,  whereas  X would  necessarily  have  new  members.  Writing  j(X)  for  the 
extension  of  X in  the  formally  projected  universe,  he  came  to  the  schema: 

(t)  Vx  e VVX  c V((pv<v)[x,  X]  <p[x,  j(X)])  . 

This  is  the  exact  counterpart  to  the  aforementioned  extension  of  Ackerman’s  A in 
Reinhardt’s  thesis  that  led  to  Silver’s  formulation  of  1 -extendibility.  It  is  consistent 
with  the  persistent  theme  that  sets  are  sharply  delimited  and  determined  by  their 
extensions,  whereas  classes  in  the  sense  of  properties  have  an  essential  intensional 
quality. 

Towards  full  extendibility,  Reinhardt  not  only  considered  the  cumulative  hi- 
erarchy as  proceeding  beyond  V — Von  in  the  formally  projected  universe,  but 
projected  versions  of  these  further  levels.  While  the  introduction  of  new  objects 
beyond  Cantor’s  intentions  may  violate  the  universality  of  the  concept  of  set,  Rein- 
hardt proposed  ([74a:  198])  “to  mitigate  this  sorrow  by  seeing  the  universality  not 
in  the  extension  of  the  concept  of  set  but  in  the  applicability  of  the  theory  of  sets.” 
This  recalls  his  observation  in  [74:  13]  that  the  axioms  of  ZF  follow  outright,  not 
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just  relativized  to  V,  from  his  S*.  Let  A be  a formal  ordinal  greater  than  On,  with 
corresponding  V-,  . Reinhardt  argued  for  a projected  realm  VQn'  corresponding  to 
V — Von,  a Vx'  corresponding  to  Vx,  and  a correspondence  j:  Vx  — > Vx>  so  that 
j(x)  = x for  x e V,  j (On)  = On',  and  the  theory  is  preserved: 

There  is  a j : Vx  < V-,j  with  crit(j)  = On  and  j (On)  = On'  , 

and  in  analogy  with  (f)  for  X — On+ 1,  X < j (On).  This  of  course  is  the  assertion 
of  /--extend i hi  1 i ty  of  On  (at  least  when  X > On  • co). 

The  theory  of  properties  with  necessity  operator  in  [80]  is  the  most  mature  and 
sophisticated  formalization  of  Reinhardt’s  ideas.  Here,  properties  are  elevated  to 
the  status  of  rich,  intensional  entities,  and  modal  operators  are  used  to  mediate  with 
extensional  entities.  A professed  feature  is  expressive  power  adequate  to  formulate 
extendibility,  although  he  only  indicated  the  treatment  of  a simple  version. 

Extendibility  is  now  brought  into  the  fold  by  describing  its  relationships  to  the 
other  hypotheses,  especially  supercompactness.  As  for  §22,  Solovay-Reinhardt- 
Kanamori  [78]  continues  to  serve  as  a general  reference. 

A simple  lemma  leads  to  the  initial  observation. 


23.2  Lemma.  There  is  a formula  cp(  i>o,  Vi)  of  C€  such  that  for  any  a < 

Vp  |=  x]  iff  x — Va  . 

Proof.  Consider  the  formulas 

: f is  a function  A dom(/)  e On  A 
Vy  e dom(/)(/(y)  = [j {?(/(£))  | $ < y})  . 
ti-  v0  = dom (/)  A m = [J{' P(f(y))  I Y e dom(/)}  . 
fy.  v0  = dom (/)  + 1 A iq  = V(J{V( f(y))  \ y e dom(/)})  . 

Va:  vo  = dom (/)  + 2 a iq  = V(f(y))  \ y e dom (/)}))  . 

i//i  asserts  that  / must  satisfy  the  usual  recursive  definition  of  the  rank  hi- 
erarchy, so  that  f(y)  = Vy  for  y e dom(/).  (The  proof  could  alternately  have 
been  cast  in  terms  of  the  rank  function  (cf.  0.2).) 

Let  <p(yo,  Vi)  be  3 f{f\  A (f/2  v i/f3  v t//4)).  To  show  that  this  works,  suppose 
that  a < ft.  It  is  simple  to  see  that  if  |=  (p\a,  x],  then  x = Va.  For  the 
converse,  assume  that  x — Va.  It  is  straightforward  to  check  that  if  / satisfies 
i//i , then  / e Vdom(/)+3,  and  if  dom(/)  is  a limit  ordinal,  / e Vdom(/)+i-  Hence, 
if  a + 2 < /3  or  a is  a limit  ordinal,  then  Vp  |=  3f(fr\  A ^ )[«,  x].  In  the 
remaining  cases  argue  as  follows:  If  a is  a successor  ordinal  and  either  a + 2 = ji 
or  else  a + 1 = /3  and  a — 1 is  a limit  ordinal,  then  there  is  an  / e Vp  with 
dom(/)  = a — 1 satisfying  fi,  and  so  Vp  1=  3 fifi  A t//3  ) [ a,  x].  Finally,  if  a is 
a successor  ordinal,  a + 1 = /3,  and  a — 1 is  a successor  ordinal,  then  there  is  an 
/ e Vp  with  dom(/)  = a — 2 satisfying  i/q,  and  so  Vp  \=  3/(^i  A Va )[a,  x],  ~\ 
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23.3  Proposition.  If  k is  q-extendible  and  0 < S < ij,  then  k is  8-extendible. 

Proof.  Let  j:  VK+r]  < witness  the  ^-extendibility  of  at.  It  follows  from  the 
lemma  that  j(VK+$)  — Vj(K+s).  Hence,  formulas  can  be  relativized  to  show  that 
j | VK+s : VK+s  -<  Vj(K+s),  confirming  5-extendibility:  For  any  formula  cp(v\ , . . . ,v„) 
and  sets  x\, ... , x„  in  VK+$, 

Vk+8  \—  ? • • * . %n]  Iff  Vk+i]  I — T [-1 1 . • • • . -L?] 

ff  V(  |=  <pVj<‘+»  [j  Oi j (*„)] 

iff  VHk+S)  |=  cp[j(x i),  . . . , j(xn )]  . H 

While  it  is  consistent  with  the  supercompactness  of  k that  there  is  no  inac- 
cessible cardinal  above  k (22.8),  the  extendibility  of  k implies  the  existence  of 
many  large  cardinal  above  k:  If  j:  VK+\  -<  V/m+i  witnesses  the  1 -extendibility 
of  k,  then  k is  measurable  so  that  j(K)  is  inaccessible  in  Vj(K)+t  by  elementar- 
ity  and  hence  inaccessible  in  V as  V{ j(k))  c Vj(K)+\ . Moreover,  by  23.1  and 
elementarily  {a  < j (k)  \ a is  measurable}  is  unbounded  in  j (k).  In  turn,  if 
j:  Vk+2  -<  Vj (k)+2  witnesses  the  2-extendibility  of  at,  then  j (k)  is  itself  measur- 
able in  V as  V(V(j  («■)))  c y,-(K)+2.  In  these  ways,  extendibility  projects  stronger 
and  strong  hypotheses  upward.  These  considerations  illustrate  how  strongly  the 
existence  of  an  extendible  cardinal  affects  the  higher  levels  of  the  cumulative  hier- 
archy, and  why  ^-extendibility  cannot  be  formulated,  as  y -supercompactness  can, 
in  terms  of  the  existence  of  ultrafilters. 

Extendibility  is  next  characterized  in  terms  of  the  higher-order  analogue  of 
the  original  formulation  of  strong  compactness.  This  result  of  Magidor  reinforces 
extendibility  by  explicating  it  as  a direct  generalization  of  a prior  concept.  The 
Lxf_L  languages  were  defined  in  §4;  for  1 < n e co,  an  L’ft  language  is  an  extension 
of  an  Lxjx  language  allowing  higher-order  variables  of  type  at  most  n.  With  the 
intended  interpretation  of  type  i + 1 variables  as  ranging  over  V (D)  where  D is 
the  domain,  there  are  the  corresponding  notions  of  satisfiability  and  v -satisfiability 

as  for  LU  = L^- 

k is  L'^-compact  iff  any  collection  of  L"  sentences, 
if  v-satisfiable,  is  satisfiable  . 

Thus,  k > co  and  k is  LlKK- compact  iff  k is  strongly  compact. 

23.4  Theorem  (Magidor  [71a]).  The  following  are  equivalent  for  k > co: 

(a)  a:  is  extendible. 

(b)  k is  LnKK-compact  for  every  1 < n e co. 

(c)  k is  L2Klo-compact. 

Proof.  Suppose  first  that  k is  extendible,  1 < n e co,  and  £ is  a k -satisfiable 
collection  of  L'lK  sentences.  It  can  be  assumed  that  £ is  coded  as  a set  via  an 
arithmetization  of  the  language  where  the  logical  symbols  are  coded  by  members 
of  VK.  Let  q > k ■ co  be  a limit  ordinal  sufficiently  large  so  that  |Zj  < ;/  and 
V,t  1=  £ is  AT-satisfiable.  By  q -extendibility,  let  j:  Vv  -<  with  crifiy)  = k and 
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tj  < j (k).  Then  Vj  |=  j(E)  is  yfiO-satisfiable.  Note  that  j“E  c j{E)  and  so 
j“E  e Vf , and  that  \j“E\  < rj  < j(K ) and  so  this  holds  in  as  £ must  be  a limit 
ordinal.  Hence,  \=  j“E  has  a model  A,  in  which  case  A really  is  a model  of 
j“E.  But  then,  A is  a model  of  E,  since  the  formulas  in  j“E  are  like  those  of  E 
with  at  most  the  non-logical  symbols  renamed.  It  follows  that  k is  L"K  -compact. 

To  complete  the  proof,  it  remains  to  show  that  if  k is  L2Klo -compact,  then  k 
is  extendible.  Suppose  then  that  ij  > k ■ <u;  we  seek  a £ and  a j : Vn  -<  Vj-  with 
crit( j)  — k and  1 7 < j{k).  In  terms  of  distinct  constants  x for  x e Vn  and  ca 
where  a < 17,  let  E be  the  L2K0)  theory  of 

(V,,  c,  x)x€vrl 

together  with  the  sentences  ^ca  e ic  A ca  e c ^ for  a < < i].  Then  E is 
/r-satisfiable,  so  by  hypothesis  it  is  satisfiable.  Now  E contains  a 77i  sentence  a 
so  that  for  any  transitive  set  X,  X |=  cr  iff  X = Va  for  some  a (cf.  0.2),  and  a TJ\ 
sentence  r asserting  that  e is  well-founded.  Hence,  taking  a transitive  collapse  E 
has  a model  of  form 

, G,  X,  Yci) xeVq&Sri  > 

where  ya  interprets  ca.  Clearly,  the  map  j\  V , — >■  Vj-  defined  by  j(x)  = x is 
elementary.  Also,  for  any  a < k,  E contains  the  LKa)  sentence 

Vv(v  e a v = f)  . 

so  that  by  induction  j is  the  identity  on  k.  Finally,  ;;  < jOO  since  {ya  \ a < r/} 
is  a subset  of  / (at)  of  ordertype  ij  + 1 > r],  H 

Although  extendibility  reflects  more  ethereal  ambitions  than  supercompact- 
ness, the  next  two  results  show  how  their  local  versions  are  closely  intertwined. 

23.5  Exercise.  If  k is  \VK+tl\-supercompact  and  j]  < k,  then  there  is  a normal 
ultrafilter  U over  k such  that 

{a  < k | a is  ?; -extendible}  e U . 

Hint.  Argue  as  for  the  forward  direction  of  22.10.  H 

23.6  Proposition.  If  k is  rj-extendible  and  S + 2 < rj,  then  k is  Vk+f,\- 
supercompact.  Hence,  if  k is  extendible,  then  k is  supercompact. 

Proof.  Let  j:  VK+t]  -<  witness  the  ^-extendibility  of  k.  Noting  that 
V(VKVK+S)  c Vk+s+2  C Vn,  define  U by: 

XeU  iff  X c VKVK+S  a j“VK+s  e j(X)  . 

Then  it  is  straightforward  to  check  that  this  is  a normal  ultrafilter  over  VKVK+$, 
provided  that  enough  apparatus  can  be  coded  into  the  domain  of  j.  This  however 
is  routine  to  check: 
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To  verify,  for  example,  the  choice  function  condition  (cf.  22.6),  suppose  that 
/ is  a choice  function  on  VKVK+$.  For  each  y e VK+$,  set 

J+  = (y  n (' VK+s  - &>))  U {n  + 1 I n e y (T  co] 

so  that  0 f y.  Then  for  each  x e VKVK+$,  set 

xf  = {y+  |yei)U  {{0}  U /(*)+} 

so  that  0 tags  fix).  Clearly  Xf  e Vk+s+i.  Finally,  set 

X/  = {xf  | x e VKVK+&] 

so  that  Xf  e VK+$+ 2.  Hence,  Xf  e Vr)  and  faithfully  codes  /,  and  can  be  used  in 
conjunction  with  the  elementarity  of  j to  verify  the  choice  function  condition  for 

/•  H 


That  an  extendible  cardinal  is  supercompact  can  also  be  derived  along  the 
lines  of  22.10. 

The  conjunction  of  even  1 -extendibility  and  supercompactness  transcends 
supercompactness : 

23.7  Proposition.  If  k is  supercompact  and  l-extendible,  then  there  is  a normal 
ultrafilter  U over  K such  that 

{a  < k | a is  supercompact}  e U . 

In  particular,  the  least  supercompact  cardinal  is  not  l-extendible. 

Proof.  Let  j : VK+\  < VpK)+\  witness  the  1 -extendibility  of  k,  and  U the  usual 
normal  ultrafilter  over  k defined  from  j.  Since  j (k)  is  inaccessible,  V/oo+i  N K 
is  y -supercompact  for  k < y < j(K).  Hence, 

A — {a  < k | a is  y -supercompact  for  a < y < k)  e U . 

But  v is  supercompact,  so  by  22.9  every  a e A is  supercompact.  H 

This  readily  leads  to  a rich  supply  of  cardinals  for  Menas’s  result  22.19: 

23.8  Exercise.  The  conclusion  of  23. 7 can  be  strengthened  to: 

{a  < k | a is  supercompact  and  a limit  of  supercompact  cardinals}  e U . H 


On  the  other  hand,  supercompactness  reins  back  its  stronger  cousin  in  the 
following  sense: 

23.9  Exercise. 

(a)  If  k < X,  k is  extendible,  and  X is  supercompact,  then  V\  (=  k is  ex- 
tendible. 
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(b)  If  Con(ZFC  + 3 k(k  is  extendible)),  then  Con(ZFC  + 3 k(k  is  extendible 
A -i3A(A  > k A A.  is  supercompact))). 


Hint.  For  (a),  remembering  that  f = Vf  is  77jZF 
773zf  and  apply  22.3.  For  (b),  use  (a)  and  22.9. 


check  that  f is  extendible1  is 

H 


Hence,  while  extendibility  may  imply  the  existence  of  proper  classes  of  var- 
ious large  cardinals,  this  does  not  encompass  supercompact  cardinals. 

Finally,  the  next  result  is  analogous  to  22.3: 


23.10  Proposition.  If  k is  extendible,  then  VK  -<3  V. 

Proof  Suppose  that  <p  is  -£3,  say  3vof(vo,  tq)  where  f is  II2  for  simplicity,  and 
x e VK.  It  follows  from  22.3  that  if  VK  |=  <p[x],  then  (p[x~\.  For  the  converse, 
we  only  need  the  simple  corollary  of  extendibility  that  there  are  arbitrarily  large 
inaccessible  X > k such  that  VK  -<  V\\ 

Assume  that  <p[x],  and  let  y be  such  that  1 jr[y,  x].  Let  X > k be  inaccessible 
such  that  VK  -<  V>_  and  y e Vk.  Since  f is  TL  and  Vk  = Hk,  by  the  downward 
direction  of  Levy’s  result  mentioned  before  22.3,  V>  [=  f[y,x].  But  then  V\  1= 
<p[x],  and  hence  VK  |=  cp\x ].  H 


23.11  Exercise.  If  k < X,  Vx  \=  k is  extendible,  and  X is  extendible,  then  k is 
extendible. 

Hint.  Argue  as  for  22.9,  using  f is  extendible1  is  77ZF.  H 

This  shows  that  23.10  is  optimal  in  the  sense  that  the  sentence  rthere  is 
an  extendible  cardinal1  does  not  hold  in  VK  if  k is  the  least  extendible  cardinal. 

These  various  results,  drawn  mainly  from  Solovay-Reinhardt-Kanamori  [78], 
show  how  extendibility  extends  supercompactness,  but  also  how  the  two  concepts 
are  closely  related.  Of  the  two,  supercompactness  has  become  a focal  hypothesis 
in  set  theory,  serving  as  the  beginning  hypothesis  in  several  relative  consistency 
results,  whereas  extendibility  has  receded  from  view.  The  simple  23.5,  showing 
how  local  extendibility  is  a consequence  of  local  supercompactness,  may  provide 
the  underlying  reason. 


Proofs  of  a Theorem  of  Kunen 

At  the  end  of  his  thesis  Reinhardt  briefly  considered  the  following  axiom  as 
a strong  expression  of  ideas  for  extending  Ackermann’s  set  theory:  There  is 
a j:  V -<  V.  As  noted  in  his  [74a:  200],  this  is  also  suggested  as  an  ex- 
tension of  extendibility:  For  such  a j with  crit(_/')  = k and  any  ordinal  17, 
j\VK+ri:  VK+ri  -<  Vj(K+rl),  and  so  save  for  a small  quibble  about  rj  < j(K),  it 
is  as  if  one  j works  for  all  17.  Although  not  the  original  motivation,  j:  V <V  is 
most  immediately  suggested  as  the  ultimate  possibility  in  terms  of  closure  condi- 
tions on  target  inner  models,  proceeding  progressively  from  measurability  through 
degrees  of  supercompactness. 
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Reinhardt’s  proposal  led  to  a dramatic  turn  of  events.  After  initial  results 
aroused  some  suspicion,  Kunen  established  that  in  ZFC  there  can  be  no 
j:  V < V: 

23.12  Theorem  (Kunen  [71b]).  Suppose  that  j:  V < M.  Then  M / V . 

As  the  quantification  V j over  classes  j cannot  be  formalized  in  ZFC,  this 
result  can  only  be  regarded  as  a schema  of  theorems,  one  for  each  j . Kunen’s 
argument  does  lead  to  a simple  assertion  about  sets  (23.14(a)). 

23.12  has  delimited  the  whole  large  cardinal  enterprise.  It  could  have  been 
that  j:  V -<  V would  serve  as  the  culmination  of  the  guiding  idea  of  closure 
conditions  on  range  models  of  elementary  embeddings;  a new  guiding  idea  in 
some  orthogonal  direction  would  have  been  exploited  to  formulate  still  stronger 
hypotheses;  and  so  on.  Rather,  in  quickly  resolving  the  situation  with  an  appro- 
priately simple  statement,  Kunen’s  result  sharply  defined  the  context  and  showed 
that  a completion  of  ZFC  in  a specific  sense  exists.  The  particular  form  of  the 
result  was  intriguing  and  unexpected,  and  although  the  original  proof  has  an  ad 
hoc  flavor,  what  it  established  has  not  since  been  superceded  by  any  stronger 
inconsistency  result. 

Several  proofs  of  23.12  are  given.  The  first  is  that  original  proof,  which 
applied  a combinatorial  result  of  Erdos-Hajnal  [66],  The  result  is  of  independent 
interest  in  connection  with  Jonsson  cardinals  (§8)  since  it  shows  that  if  infinitary 
operations  are  allowed,  then  there  are  Jonsson  algebras  of  every  infinite  cardinality. 
For  any  set  x of  ordinals  and  function  /, 

/ is  ca-Jdnsson  for  x iff  f:  [x]®  — > x and  for  any  y C x 
with  |y|  = |x|,  /“[y]®  = x . 

There  is  a function  w-Jonsson  for  x iff  there  is  a function  cu-Jonsson  for  \x\,  and 
for  cardinals  k,  there  is  a function  co-Jonsson  for  k iff  k— />  [ic]®  in  the  notation  of 
§8.  The  following  proof  is  drawn  from  Galvin-Prikry  [76]  and  has  an  antecedent 
in  Galvin  [65]. 

23.13  Theorem  (Erdos-Hajnal  [66]).  For  any  X,  there  is  a function  co-Jonsson  for 
X. 

Proof.  For  x,y  e [k]®  set  x ~ y iff  they  have  equal  “tails”,  i.e.  for  some 
ci  < Ur,  x — a — y — a.  Then  ~ is  an  equivalence  relation  on  [A.]®,  so  for  each 
equivalence  class  E choose  a representative  xe  e E.  (This  is  the  use  of  AC  in 
this  proof.)  Now  for  any  x e [A.]®,  with  E the  equivalence  class  of  x and  a e xe 
minimal  such  that  xE  — (a  + 1)  = x — (a  + 1 ),  set  g(x)  — a. 

To  complete  the  proof,  it  suffices  to  find  an  A e [/.]'  such  that  for  any  B e 
[A]1,  g“[fi]®  2 A,  since  for  such  an  A a function  ®-Jonsson  for  A can  be  readily 
derived  from  g.  So,  assume  to  the  contrary  that  no  such  A exists.  Then  for  n e u> 
there  are  A„  e [A.]x  and  an  e X with  A„  I)  An+\  and  an+ 1 e A„  — (a„  + 1)  such  that 
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an+ 1 ^ g“[A„+ 1]®.  Set  y = [a„  | n € to)  6 [A.]®.  With  E the  equivalence  class  of 
y let  m e co  be  such  that  for  some  a < V3xe,  [an  \ m < n < co}  = Xe  — a.  But 
then,  g({«„  | m < n < co})  = am  so  that  am  e g“[Am]w,  which  is  a contradiction. 

H 


First  Proof  of  23.12  (Kunen  [71b]).  Let  k = crit(y),  and  jn  denote  the  nth 
iteration  of  j,  i.e.  for  any  x e V,  j°(x)  = x and  jn+1(x)  = j(jn(x))  for  n e co. 
Set  A = sup({y"  (k)  | n e co})  so  that  A is  the  least  ordinal  8 above  k such  that 
j(8 ) = 8.  To  conclude  that  M f V,  it  is  established  that  j“ A f M\ 

Assume  to  the  contrary  that  j“ A e M , and  let  / be  cu-Jonsson  for  A.  In  M, 
j(f)  is  &>-Jonsson  for  j (A)  = A,  and  /“A  e [A]1  D M.  However,  we  shall  check 
that  J (/)“[./“ A]®  c j“ A,  which  is  a contradiction  since  j“ A f A (for  example, 
k e A - j“ A). 

Suppose  then  that  ^ e [j“A]®.  Clearly  there  is  a f e [A]®  such  that  j[t)  = 

j“t  = s.  But  then,  j(f)(s)  = j(f)j(t ) = j(f(t))  e j“ A.  H 

As  Kunen  observed,  since  A is  a strong  limit  cardinal  of  cofinality  co,  2l  = AK°, 
and  to  establish  23.13  for  such  A there  is  a simple  recursive  construction  available 
(albeit  one  that  uses  a bit  more  of  AC):  Let  {(.Vo.,  /„)  | a < 2X)  enumerate  [A]1  x A. 

For  a < 2}  recursively  choose  sa  e [xa}m  so  that  sa  f sp  for  ft  < a;  this  is 

possible  since  2X  = AK°.  Then  any  function  /:  [A]®  -*  A such  that  f(sa)  — ya  is 
cu-Jonsson  for  A. 

Other  proofs  of  23.12  appeared  in  the  late  1980’s.  One,  due  to  Woodin, 
depends  only  on  partitioning  stationary  sets  into  disjoint  stationary  subsets: 

Second  Proof  of  23.12  (Woodin).  Let  k — crit(y')  and  A = sup({y"(^)  I n e co}) 
as  before.  By  Solovay’s  16.9,  there  is  an  S:  k -*  "P(A+)  such  that  ran(5)  is  a 
partition  of  {£  < A+  | cf(§)  = co}  into  sets  stationary  in  A+.  (In  fact,  because 
of  the  nature  of  the  set  to  be  partitioned,  a recursive  definition  of  such  an  S is 
possible.)  Since  j( A)  = A,  A+  < j(X+ ) = A+M  < A+  so  that  equality  pervades. 
Hence,  by  elementarity  j(S ):  j (x)  —>  Vif+)  and 

(j  (S)(k)  C {£  < A+  | cf(f)  = co}  is  stationary  in  A+)M  . 

Assume  to  the  contrary  that  j (S)(k)  is  really  stationary  in  A+,  a consequence  of 
M — V . Then  by  A+ -completeness  j(S)(K)  fl  S(a o)  is  stationary  in  A+  for  some 
ag  < k.  Also, 

C = {$  < A+  | ;(§)  = | A cf(t)  = co} 

is  readily  seen  to  be  w-closed  unbounded  in  A+.  It  follows  (0.1(d))  that  there  is  a 
to  e U(S)(k)  n S(a0))  n C.  But  then,  £0  = j(?o)  e j(S(a0))  = j(S)(a0 ) so  that 
§o  € j(S)(i<)  fl  j(S)(ao),  which  is  a contradiction  since  by  elementarity  the  range 
of  j ( S ) must  consist  of  pairwise  disjoint  sets.  H 

Todorcevic  observed  that  the  consideration  of  C is  unnecessary:  Assume 
as  above  that  j(S)(K)  is  really  stationary  in  A+.  Since  /“ A+  is  unbounded  in 
;(A+)  = A+, 
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C = {H  < X+  | 3s(s  e [j“l+T  a $ = sup(,y))} 

is  a>-closed  unbounded,  so  let  $o  e j(S)(/c)  fl  C with  s e [j“X+]a>  such  that 
$o  = sup(s).  There  is  a t e [k+]£U  such  that  j ( t ) = s,  and  sup(f)  e S(qto)  for  some 
cuo  < k.  But  j( sup(f))  = £o  e j(S)(ic),  and  so  j(a0)  — k by  elementarity,  which 
is  a contradiction. 

In  fact,  for  regular  /x  > a>,  with  S:  \i  ->  V(pt)  such  that  ran(5)  is  a partition 
of  {£  < /x  | cf(f)  = &>}  into  sets  stationary  in  /x,  /:  [/x]"  ->  /u  defined  by  /(s)  = 
that  unique  a such  that  sup(.v)  e 5(a)  is  a simple  example  due  to  Solovay  (see 
Galvin-Prikry  [76:  370])  of  a function  ft;-J6nsson  for  /x.  Consequently,  we  have  a 
reversion  to  Kunen’s  argument  through  an  explicit  function  of  &>-Jonsson  sort  for 
>-•  - ./(>■•). 

The  following  is  a simpler  version  of  a proof  of  23.12  due  to  Mikio  Harada; 
rather  than  depending  on  combinatorial  contingencies  it  reveals  a more  structural 
constraint. 

Third  Proof  of  23.12  (Harada).  Letting  X be  as  before,  assume  again  to  the  contrary 
that  j“ X e M. 

Note  first  that  for  any  bijection  g : 2X  -»•  V(X).  j“X  e V(X)M  — ran (j(g)). 
So,  let  ct  be  the  least  ordinal  such  that  for  some  injective  function  F:  a — »■  V(X), 
j“ X e ran (j(F)).  Fix  such  an  F,  and  set  S — ran(F).  Now  define  U by: 

X e U iff  X c 5 a f‘X  e j(X)  . 

Then  U is  an  to\ -complete  ultrafilter  over  S,  so  let 

i:  V < N = Ult(V,  U ) . 

Recalling  the  theory  of  normal  ultrafilters  from  §22,  if  id:  S — >■  S is  the 
identity  map,  the  definition  of  U implies  successively  that: 

(i)  i“ X — [id]f/  and  so  i“ X e ranl/lF))  , 

(ii)  a — [(ot(v  fl  a)  \ x e S)]j/  for  a < X , and 

(iii)  * I (X  + 1)  = j\(X+  1)  . 

(iii)  follows  by  arguments  for  22.12,  since  k:  N ->  M defined  by  k([f]u)  = 

j (/K j“X)  is  elementary;  j = koi;  and  k(a)  = a for  a < X by  (ii). 

The  switch  to  ultrapowers  secures  more  structure:  By  22.4(b),  i“X  e N 

implies  that  T(X)N  — V{X)  and  also  that  i \ V-,  6 W by  a simple  argument  as  X 

is  a strong  limit  cardinal.  Now  for  any  X e ViX),  i(X)  — Uo!<x,  (^  n “)  since 
i(X)  — X,  so  that  in  turn  i“V{ X)  = {Ua<x  i(X  fl  a)  \ X e V{X)}  e N.  Again  by 
22.4(b)  it  follows  that  N has  all  the  enumerations  of  V(X)N  — V{X),  and  hence 

(*)  2X  = (2(i) (ii) (iii) * * * * * * x)n  — i((2x))  . 


The  properties  of  o are  now  used  to  derive  a contradiction.  Clearly  the 
definitions  imply  that  ct  is  a cardinal  and  |.S]  = ct  < 2X.  22.4(c)  implies  that 
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i“er+  ^ N;  but  since  i“V(X)  e N as  noted  above,  it  follows  from  22.4(a)(b) 
that  2X  < ct+.  Hence,  it  must  be  that  cr  = 2f . But  then,  /(cr)  = a 
from  (*),  so  that  i (cr)  = sup(/“a).  Since  i“ X e ran(/(/;’))  by  (i),  it  fol- 
lows that  i“X  e ran(/(F)|i(a ))  = ran(/(F|a!))  for  some  a < a.  But  then, 
j“X  e ran(j  (F\a))  by  definition  of  U and  /,  contradicting  the  minimality  of  cr. 

-I 


These  proofs  of  23.12  lead  to  the  following  refinements: 

23.14  Corollary. 

(a)  For  any  8,  there  is  no  j:  Vs+2  -<  V&+ 2- 

(b)  If  j'.  V < M and  8 is  the  least  ordinal  above  crit(y)  such  that  j (8)  = 8, 
then  j“S  M. 

Proof,  (a)  Assume  to  the  contrary  that  there  is  such  a j . Clearly  j (8)  — 8,  and 
crit(y)  < 8.  (Elementary  embeddings  are  not  the  identity  on  their  domains  by 
convention,  and  the  argument  for  5.1(b)  implies  that  j must  move  some  ordinal.) 
As  before,  let  k — crit(j),  and  noting  that  each  iterate  j"  (k)  is  defined  and  less 
than  8,  set  X — sup({y"0r)  | n e w})  < 8.  Kunen’s  contradiction  can  now  be 
derived  since  the  function  w-Jonsson  for  X is  in  Vx+i,  and  hence  in  the  domain 

of  j- 

For  Woodin’s  proof,  note  that  subsets  of  /.+  can  be  faithfully  coded  by 
members  of  Vx+2 ■ For  example,  for  each  well-ordering  R c X x X of  X,  let 
aR  < X+  be  its  ordertype,  and  eR\  ( aR , <)  -»  (X,  R)  the  order-isomorphism.  Let 
p:  (X  x A.)  x X —>■  X be  a bijection,  noting  that  p C Vx-  Then  for  any  IQ+, 

{p“(R  x eR\X  n aR))  \ R is  a well-ordering  of  A}  e Vx+2 

can  serve  as  a code  for  X.  Short  sequences  of  subsets  of  X+  can  be  similarly  coded, 
and  it  is  straightforward  to  see  that  the  relevant  assertions  like  the  stationariness 
of  a subset  of  X+  can  be  expressed  in  Vx+2  in  terms  of  codes.  Hence,  Woodin’s 
contradiction  can  be  derived  via  codes  with  j \Vx+2- 

Harada’s  proof  can  also  be  carried  out  with  j | Vx+2,  since  every  well-ordering 
of  a subset  of  V{X)  can  be  coded  by  a member  of  Vx+2,  and  so  the  a and 
corresponding  F and  U c Vx+2  can  be  defined  via  such  codes. 

(b)  Note  that  if  X is  again  defined  as  before  from  crit(y').  then  8 — X.  Kunen’s 
and  Harada’s  proofs  thus  show  that  j“S  M.  To  push  through  Woodin’s  ar- 
gument, first  switch  from  j to  an  ultrapower  embedding  as  in  Harada’s  proof: 
Assuming  that  j“ X e M,  define  U over  V(X)  by 

XeU  iff  X<ZV(X)  a j“X  e j(X)  , 

and  let  i:  V ~<  N = Ult(V,  U).  Then  as  seen  before,  crit(i)  =crit(7X  OX)  = 
X,  i“X  e N , and  i“V( X)  e N.  Hence  c N,  and  Woodin’s  proof  can  be 
carried  out  via  codes  as  for  (a).  H 
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The  following  makes  good  on  a small  debt  concerning  the  definitions  of 
y -supercompactness  and  extendibility. 

23.15  Proposition. 

(a)  k is  y -supercompact  iff  there  is  a j:  V < M with  crit(y')  = k and  Y M C 
M. 

(b)  k is  extendible  iff for  any  r]  > k there  is  a j:  Vtl  < with  crit(j)  = k. 

Proof,  (a)  Suppose  that  j is  as  stated;  only  the  condition  y < j(ic ) is  missing. 
But  by  23.14(b),  y < sup ({/'(«■)  | n e &>}),  and  hence  y < jn(K ) for  some  n e co. 
Hence,  it  suffices  to  establish 

(*)  For  every  n e w with  n > 0 there  is  an  inner  model 

Mn  of  ZFC  such  that  jn:  V -<  M„  and  yM„  c Mn  . 

To  this  end  first  extend  j to  classes  R by  stipulating  that 

y+(*)  = Uj(tfnv„). 

Note  that  for  classes  R\, . . .,  Rm, 

j:  (V,e,Ru...,Rm)  < (M.eJ+iRf j+(Rm)) 

(where  extending  a standing  convention,  -<  is  -<i  for  formalizability)  via  the  Re- 
flection Principle  for  ZF. 

(*)  can  now  be  established  by  induction  on  n.  For  n = 1 it  holds  with 
M\  — M.  Assume  now  that  it  holds  for  n,  and  set  Mn+\  — j+(M„).  Since 

j:  (V,  e,  j\  Mn)  < (M,  e,  j+(jn),  Mn+l)  , 

Mn+ 1 is  transitive,  and  j+( j"):  M -<  Mn+\  so  that  j+(jn)  o j:  V -<  Mn+\.  But 
j+(jn)  3 {(j(x)Jn+\x))  | x e R>},  and  so  j+(j")  o j = jn+1.  Finally,  since 
(V,  e,  Mn)  < (M,  e,  Mn+1)  and  Y Mn  c Mn,  j(r)Mn+1  n M c Mn+U  and  so 

YMn+ 1 = Y Mn+\  n M c Mn+\  . 

(b)  It  suffices  to  show  from  the  latter  assertion  that,  taking  ?/  > k ■ co,  k is 
?7-extendible  - only  the  condition  r\  < j (k)  is  missing.  For  this  purpose,  let  y > i] 
be  such  that 

(i)  whenever  (3  < y and  for  some  t;  there  is  a k:  Vn  -<  with 
crit(k)  = k and  k(K)  = f,  there  is  such  a k with  a f < y . 

(ii)  cf (y)  = m i . 

Such  a y exists  by  a simple  closure  argument  iterated  co\  times.  By  hypothesis, 
let  j:  Vy  -<  Vp  with  crit(j)  = k.  If  7" (at)  were  defined  for  every  n e co,  then 
sup ({/'(k)  I n e w})  < y as  cf(y)  = ay , and  we  would  be  able  to  derive  Kunen’s 
contradiction.  Hence,  it  follows  that  there  is  an  m e co  such  that  j"'(K)  < y < 
jm+\K). 
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Let  P(i)  be  the  assertion: 

There  is  an  e : Vrj  -<  Vj  for  some  £ with  crit(e)  = k and  e(/e)  = jl+l(jc)  • 

It  suffices  to  establish  P(m),  for  then  ?/-extendibility  would  follow  from  ?;  < Y < 
j'n+l(K).  To  this  end,  P(i)  is  established  for  every  i < m by  induction  on  i: 

Set  j — j\VT  Then  as  for  23.3,  j:  Vn  -<  Vj(rl)  with  crit(y)  = k and  J(k)  = 
j{k),  and  P( 0)  holds.  Now  assume  P(i)  where  i < m.  Then  because  jl+l{tc)  < y 
it  follows  from  (i)  that  for  some  £ < y there  is  an  k : Vn  -<  V, \ with  crit(A:)  = k 
and  k(K)  — j'+1(K).  j can  therefore  by  applied  to  k to  conclude  by  23.2  and 
elementarily  that  in  Vp,  and  hence  in  V , there  is  an  k:  Vj(v)  -<  with  crit(A:)  = 
j(K)  and  k(j(K ))  = /+2(at).  But  then,  with  j as  above,  k o j:  V,t  -<  V,-^)  with 
crit(L  o j)  = k and  k o j'(at)  = y',+2(/r).  Hence,  P(i  + 1)  obtains,  and  the  proof  is 
complete.  H 

Because  of  its  obvious  import,  Kunen’s  result  quickly  came  under  close 
scrutiny.  A prominent  feature  of  the  proofs  is  the  appeal  to  the  Axiom  of  Choice 
at  the  level  of  a well-ordering  of  V(X).  This  is  a crucial  factor  in  getting  an  co- 
Jonsson  function  for  A in  the  first  proof  (actually,  a well-ordering  of  [k]“  suffices 
for  the  given  proof  of  23.13),  in  splitting  {£  < X+  \ cf(§)  = &>}  into  k stationary 
subsets  of  X+  (through  their  codes  in  V(X)),  and  in  arguing  for  the  existence  of 
a in  the  third.  The  following  unresolved  question  is  therefore  of  foundational 
interest: 

23.16  Question.  Is  it  provable  in  ZF  that  there  is  no  j:  V < V ? 

Pending  an  answer,  Kunen’s  result  can  best  be  viewed  as  an  ultimate  lim- 
itation imposed  by  the  Axiom  of  Choice  on  the  extent  of  reflection  possible  in 
the  universe.  ZFC  rallies  at  last  to  force  a veritable  Gdtterdammerung  for  large 
cardinals! 
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24.  The  Strongest  Hypotheses 

This  section  describes  the  strongest  large  cardinal  hypotheses  not  known  to  be  in- 
consistent, starting  with  those  on  the  verge  of  Kunen’s  result  and  then  n -hugeness 
and  Vopenka’s  Principle.  Like  the  development  of  set  theory  after  the  discovery  of 
the  antinomies,  there  was  a stepping  back  from  the  precipice  of  Kunen’s  inconsis- 
tency, a charting  out  of  possibilities  that  remained,  and  with  the  passage  of  time, 
a growing  confidence  in  the  delimited  edifice.  However,  unlike  the  emergence  of 
the  cumulative  hierarchy  and  other  guiding  ideas  that  provided  intuitive  underpin- 
nings for  ZFC,  it  is  doubtful  that  even  heuristic  arguments  can  be  put  forward  for 
the  optimality  of  Kunen’s  result,  because  of  its  basis  in  a specific  mathematical 
contingency.  The  strongest  hypotheses  thus  stand  on  much  shakier  ground,  but 
their  study  has  a natural  appeal  owing  to  the  power  and  simplicity  of  the  concepts 
involved  as  well  as  the  possibility  of  some  new  apocalyptic  inconsistency. 

The  following  propositions  just  weaker  than  those  denied  by  23.14  were 
considered  in  Gaifman  [74]  and  Solovay-Reinhardt-Kanamori  [78]: 

11.  For  some  S,  there  is  a j:  Vg+\  -<  V5+1  . 

12.  There  is  a j:  V -<  M such  that  Vg  c M 
for  some  <5  > crit(y')  satisfying  j(8 ) = 8 . 

13.  For  some  8,  there  is  a j:  Vg  <Vg  . 

The  class  assertion  12  is  formalizable  through  a coming  set  characterization. 
A proposition  yet  stronger  than  all  of  these,  10,  will  be  mentioned  shortly. 

These  propositions  have  thus  far  defied  all  attempts  at  refutation  in  ZFC. 
It  is  clear  from  the  proof  of  23.14  that  if  in  each  case  k — crit(j)  and  A — 
sup ({jn(ic)  I n e a> })  as  before,  then  necessarily  8 is  X,  except  that  for  13  8 could 
also  be  X + 1 to  yield  the  stronger  II.  One  attempt  to  refute  II  would  be  to  get  a 
function  in  V-,+ 1 that  would  play  the  role  of  the  function  w- Jo ns son  for  X.  Since 
X is  a limit  of  measurable  cardinals,  namely  the  y"(/c)’ s,  X is  Jonsson  (8.7)  so  that 
[A.]"  cannot  be  replaced  by  [A]<<w  in  /:  [A]"  — > A.  However,  a positive  answer  to 
the  following  would  refute  II. 

24.1  Question  (Magidor).  For  strong  limit  A is  there  an  f : UvcaM*  — 1 y ^ su°h 
that  whenever  y e [A]\  /“(Uv<A.tu  ^ >’]“)  = 

That  is,  is  there  an  analogue  of  the  function  w-Jonsson  for  A with  domain 
consisting  of  the  subsets  of  A of  ordertype  a>  that  are  bounded  below  A?  Of 
course,  such  an  analogue  would  be  in  V>  + 1 . Masahiro  Shioya  [93]  has  shown  that 
a positive  answer  to  24.1  would  also  refute  12. 

Some  results  about  11-13  are  now  established,  and  later  some  recent  work 
described  in  light  of  which  these  propositions  might  be  approached  with  some 
confidence.  Recalling  the  proof  of  23.15(a),  for  j:  Vg  -<  Vg  with  8 a limit  ordinal 
define  an  extension  j+:  V5+1  — > V5+1  by  stipulating  for  R c Vg  that 

;+W  = U.<l;(*nv») . 
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Since  Vg  x Vg  c Vg,  j+(R)  is  thus  defined  for  relations  R on  Vs,  functions 
R:  Vg  —>  Vs,  and  the  like.  j+  is  not  generally  elementary,  but  significantly,  it  is 
the  only  possibility  for  an  elementary  extension  of  j to  domain  Vg+ 1 • 

The  following  characterization  of  12,  evident  from  Gaifman  [74:  89]  or  Powell 
[74:  129],  clarifies  its  relationship  to  II  and  13: 

24.2  Proposition.  The  following  are  equivalent  for  any  k and  8: 

(a)  There  is  a j:  V -<  M with  crit (j)  = k < 8 = j(S)  and  Vs  c M. 

(b)  There  is  a j:  Vg  -<  Vg  with  crit(y)  = k such  that  whenever  R is  a well- 
founded  relation  on  Vg,  so  is  j+(R). 

Proof  For  the  forward  direction,  set  j = j\Vs . Then  j : Vg  -<  Vg,  and  for  any 
R c Vg,  j+(R)  — j (R).  If  moreover  R is  a well-founded  relation,  then  A7  |=  j ( R ) 
is  well-founded.  But  then,  by  absoluteness  (0.3)  j(R)  is  well-founded  in  V. 

For  the  converse  direction,  the  idea  is  that  although  no  ultrapower  embed- 
ding can  have  the  requisite  properties,  we  can  take  a direct  limit  of  ultrapower 
embeddings:  For  n e co,  set  k„  — jn(K ) and  define  \Jn  by: 

XeUn  iff  X c V{k„)  a f‘Kn  e j(X)  . 

Noting  that  j“Kn  e V(k„+\)  and  X c V(k„)  implies  that  j(X)  c V(Kn+ 1),  it  is 
simple  to  check  that  Un  is  a k -complete  ultrafilter  over  V(Kn).  Set 

in.  v <Mn=  Ult(V,  Un)  . 

Recalling  the  theory  of  normal  ultrafilters  in  §22,  with  id,,:  V(Kn)  ->  V{k„)  the 
identity  map, 

(i)  i,fK„  = [id„]f/„  , and  so 

(ii)  a — [(ot(v  fla)  | j e V(Kn))]u„  for  a < k„  , 

(iii)  crit(7„)  = k , and 

(iv)  K„  c Mn  . 

For  (iii),  note  that  by  the  representation  of  k given  by  (ii)  and  the  definition  of 
U„,  k < in{K)  iff  ot (j“Kn  fl  < j ( k ),  and  the  latter  holds  since  the  ordertype 

is  k.  (iv)  follows  from  (i)  and  22.4(a)  by  an  inductive  argument,  since  Kn  is 
inaccessible. 

Next,  for  n < m < co  define  knm:  M„  ->  Mm  by: 

knm([f]u„)  = [</(*  n*rB)  | x e V(icm))]Um  . 

Then  recalling  22.12  and  subsequent  remarks, 

(v)  knm:  M„  -<  Mm  , and 

(vi)  knm  (x)  = x for  every  x e VKn  . 

For  (vi),  knm{a)  — a for  every  a < k„  by  (ii),  and  so  with  (iv)  the  rank  argument 
for  5.1(b)  can  be  used. 

Finally,  as  {{( Mn , e)  \ n e co),  (knm  \ n < in))  is  seen  to  be  a directed  system, 
let  (M,  Gw)  be  its  direct  limit.  For  n e co  let 
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kn  : { M„ , e)  -<  (M,  eM> 

be  the  corresponding  embedding  so  that  km  o knm  — kn  for  n < m , and  let 

i:  ( V , e>  -<  (M,  eM> 

so  that  kn  o in  = i.  (M,  eM>  is  well-founded: 

Assume  to  the  contrary  that  there  are  zr  e M such  that  zr+ 1 OnM 

for  every  r e a>.  For  such  r,  let  K„r  be  such  that  K„r  < Kllr+l , and  gr:  V(K„r)  -*  On 
such  that  k„r([gr ])  = zr.  Since  | [J(.€(i)ran(g,.)|  < 8,  there  is  a well-ordering  R 
of  <5  and  an  order-preserving  injection  p:  (U(.€(B  ran(g,  ),  <}  — »■  (S,  R).  It  is  now 
straightforward  to  check  from  the  definitions  of  the  £/„’ s and  knm’s  that  for  every 
n e cu, 

(j(P  ° gr+l)(j“K„r+1),  j(p  O gr)U“K„r))  € j+ (R)  ■ 

But  then,  j+(R)  is  ill-founded,  contradicting  a key  hypothesis  on  j. 

M can  now  be  identified  with  its  transitive  collapse,  and  with  e.  The 
proof  is  complete,  since  crit(f')  = k by  (iii)  and  Vj  c M by  (iv)  and  (vi).  H 

This  characterization  leads  to  the  following  transcendence  results  of  II  over 
12  and  12  over  13.  For  convenience,  temporarily  let  II (k,  S)  be  the  proposition 
that  there  is  a j:  V^+i  -<  V^+i  with  crit(j')  = k,  and  analogously  define  12(k,8) 
(in  terms  of  the  above  characterization  for  formalizability)  and  I3(/c,  S). 

24.3  Proposition.  Suppose  that  II (at,  S).  Then: 

(a)  (Gaifman  [74:  89])  12 (k,  S). 

(b)  There  is  a normal  ultrafilter  U over  k such  that 

{a  < k | I2(a,  (5)}  e U . 

Proof.  Suppose  that  j:  Vs+ 1 -<  Vj+i  with  crit(y)  = k.  For  (a),  set  j — j\V$. 
Then  j:  Vs  -<  Vs  by  the  argument  for  23.3,  and  for  any  R C Vs,  J+  (R)  — j (R ). 
Moreover,  R is  a well-founded  relation  exactly  when  it  is  so  in  the  sense  of  Vs+ 1, 
since  V^+i  contains  all  functions:  a>  -»  Vs-  Hence,  the  well-foundedness  of  R 
implies  the  well-foundedness  of  j(R)  by  elementarity. 

For  (b),  note  that  the  above  argument  shows  that  V^+i  |=  12 (at , 8)  since  there 
is  an  S e Vj+i  coding  the  satisfaction  relation  for  Vs-  Hence,  the  result  follows 
by  the  usual  reflection  argument  using  the  normal  ultrafilter  over  k defined  from 

j-  H 

24.4  Proposition.  Suppose  that  12 (k,  8).  Then: 

(a)  I3(at,  5). 

(b)  There  is  a normal  ultrafilter  U over  k such  that 

{or  < a:  | I3(a,  5)}  e U . 

Proof,  (a)  is  immediate  in  terms  of  24.2.  For  (b),  first  let  j\  V < M with 
crit(j)  — k < 8 = j(8)  and  Vs  c M as  in  that  characterization.  It  suffices,  by 
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the  usual  reflection  argument  using  the  normal  ultrafilter  over  k defined  from  j, 
to  show  that 

M f=  3/(7:  Vs  -<  Vs  A crit(/)  = k)  . 

It  cannot  be  asserted  that  j\Vs  e M,  but  a well-foundedness  argument  suffices: 
Set 


I = {i  \ i:  Va<  Vp  for  some  a < f < S and  crit(/)  = k}  . 

Noting  that  by  elementarity  the  cofinality  of  S in  M is  also  co,  fix  a sequence 
{yn  n e to)  e M cofinal  in  <5.  Now  define  < on  I as  follows:  For  i,  i'  e I,  say 
with  i:  Va  ~<  V^  and  /':  Va>  -<  Vp/, 

i < /'  iff  i 3 /'  A 3 n e co(yn  e dom(/)  — dom(/'))  ■ 

Then  let  <*  be  the  reflexive,  transitive  closure  of  <,  i.e.  that  partial  order  in  which 
i'  is  an  immediate  successor  of  / exactly  when  i < /'. 

Since  Vs  c M and  ( yn  | n e co)  e M,  IM  = I and  <*M  — <*.  If  <*  is 
ill-founded  in  M,  then  the  union  of  any  infinite  <*-descending  sequence  would  be 
an  i:  Vs  ■<  Vs  as  desired.  So,  assume  to  the  contrary  that  <*  is  well-founded  in  M. 
But  then,  by  absoluteness  (0.3)  <*  is  well-founded  in  V.  This  is  a contradiction, 
since  some  subsequence  of  (j  |V,-»+i(K)|  n e co)  is  an  infinite  <*-descending  chain. 

H 


24.3(b)  and  24.4(b)  are  simple  reflection  observations  focusing  on  the  critical 
point  k.  For  reflection  on  the  S,  Solovay  (see  Gaifman  [74:  92])  observed  that  if 
I2(v,  S ),  then  for  some  k'  < S1  < k,  13(/r',  S').  As  part  of  a latter-day,  detailed 
analysis  of  11-13  through  definability,  Laver  [97]  established  that  if  1 1 (at,  S),  then 
for  some  k'  < S'  < k,  12 (at'.  S'). 

The  following  is  an  observation  concerning  13.  For  j : Vs  < Vs  with  S a limit 
ordinal,  j had  been  extended  to  a j+:  Vs+\  ->  Vs+\  by  stipulating  for  R c Vs  that 
j+(R)  — Ua<i  j ( R n Va).  What  about  taking  R itself  to  be  a map  k:  Vs  -<  Vfi 

24.5  Exercise.  Suppose  that  j:  Vs  -<  Vs  and  k:  Vs  -<  Vs.  Then  j+(k ):  Vs  -<  Vs, 
and  cht(j+(k))  = j(crit(k)). 

Hint.  One  way  to  show  that  j+(k)  is  elementary  is  to  replace  quantifiers  by 
Skolem  functions  and  to  use  a reflection  argument.  H 

Hence,  while  the  extendibility  of  k may  imply  the  existence  of  various  large 
cardinals  above  at,  13  is  a large  cardinal  property  of  k — crit(j)  that  implies  the 
very  same  large  cardinal  property  for  a cardinal  above  k. 

Attitudes  about  and  expectations  concerning  11-13  have  evolved  since  their 
formulation,  from  skepticism  toward  confidence  and  acceptance.  In  Solovay- 
Reinhardt-Kanamori  [78:  109]  we  had  written:  “It  seems  likely  that  11,  12,  and 
13  are  all  inconsistent  since  they  appear  to  differ  from  the  proposition  proved 
inconsistent  by  Kunen  only  in  an  inessential  technical  way.”  However,  this  may 
not  be  the  case.  As  mentioned  earlier,  the  Axiom  of  Choice  figures  prominently 
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in  the  proof  of  Kunen’s  result  by  providing  a well-ordering  of  V(k).  and  such 
well-orderings  first  appear,  in  coded  form,  in  Vx+i  — Vx+i-  Hence,  any  refutations 
of  11-13  would  have  to  have  a different  basis.  Also,  recent  work  has  provided 
some  extrinsic  evidence  for  their  coherence  and  consistency: 

Determinacy  hypotheses  are  discussed  in  Chapter  6.  In  1978  Martin  [80] 
showed  (see  31.8  and  after)  that  the  hypothesis  Definj)  follows  from  a proposition 
intermediate  between  12  and  13,  and  in  1984  Woodin  showed  (see  31.9)  that  ADL(R) 
follows  from  a proposition  stronger  than  II  and  straining  the  limits  of  consistency: 

10.  For  some  8,  there  is  a j:  L(Vs+ 1)  -<  L(V$+ i)  with  crit(j')  < 8 . 

(This  last  condition  ensures  that  j\V$+i  witnesses  II.)  The  proofs  of 23. 12  preclude 
the  possibility  here  of  L(Vs+ 1)  1=  AC.  These  Martin  and  Woodin  results,  discussed 
in  context  in  §31,  provided  the  first  indications  that  large  cardinal  hypotheses 
suffice  to  establish  strong  determinacy  hypotheses,  and  further  progress  was  to 
lead  to  remarkable  equiconsistency  results.  That  propositions  at  the  level  of  10-13 
could  serve  as  appropriate  hypotheses  for  the  initial  implementation  of  the  proof 
ideas  is  some  extrinsic  evidence  in  their  favor.  Moreover,  Woodin’ s investigations 
of  10  was  to  produce  a detailed  and  coherent  structural  theory  for  L(V$+\).  Laver 
[01,  02]  summarizes  and  develops  this  theory  further. 

Work  concerning  13  has  led  to  characterizations  of  corresponding  algebras 
of  elementary  embeddings.  Martin  had  used  the  proliferation  of  elementary  em- 
beddings given  by  24.5  and  composition  to  establish  his  determinacy  result,  and 
questions  soon  arose  about  the  supporting  algebraic  structure.  After  exploratory 
work  by  Laver  [86],  Dehomoy  [88, 89, 89a,  89b],  and  Randall  Dougherty,  Laver 
[92]  in  1988  established  the  freeness  of  the  algebras  subject  only  to  the  obvious 
identities. 

Set  £$  = {j  | j:  Vs  -<  Va}.  In  view  of  24.5,  define  a binary  operation  on 
£s  by:  j ■ k — j+(k).  Another  operation  on  £g  is  composition;  if  j,  k e £s,  then 
j o k e £s.  The  following  equalities  are  simple  to  check;  such  rules  were  first 
applied  by  Martin  [80]. 

24.6  Exercise.  Suppose  that  i,  j , k e £s . Then: 

(a)  i o ( j o k)  = (i  o j ) o k,  ( i o j ) -k-i  ■ (j  ■ k),  i ■ ( j ok)  = (i  ■ j)  o ( i ■ k), 

and  i o j = ( i ■ j)  o i. 

(h)  i ■ (j  • k)  = (i  ■ j)  ■ (;'  • k). 

Hint,  (b)  also  follows  from  (a).  -I 

Let  H denote  the  set  of  laws  in  (a);  (b)  is  the  left  distributive  law  for  ■.  Laver 
established  that  these  are  the  only  laws: 

24.7  Theorem  (Laver  [92]).  Suppose  that  j e £s . Then: 

(a)  The  closure  of  O']  in  ( £s , • } is  the  free  algebra  with  one  generator  satis- 
fying the  left  distributive  law. 
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(b)  The  closure  of  { j } in  (£g,  • , o}  is  the  free  algebra  with  one  generator 
satisfying  S.  H 

Freeness  here  has  the  standard  meaning.  For  (b),  let  W be  the  set  of  terms  in 
one  constant  a in  the  language  of  • and  o . Define  an  equivalence  relation  = on 
W by  stipulating  that  u = v iff  there  is  a sequence  u — uq,u\,  . . . ,un  — v with 
each  w,+i  obtained  from  u,  by  replacing  a subterm  of  w,  by  a term  equivalent  to  it 
according  to  one  of  the  laws  of  S.  Then  • and  o are  well-defined  for  equivalence 
classes,  and  (b)  asserts  that  the  resulting  structure  on  W /=  and  the  closure  of  {7} 
in  (£s,  • , o)  are  isomorphic  via  the  map  induced  by  sending  the  equivalence  class 
of  a to  j . 

For  u,  v e W,  define  u </,  v iff  for  some  uq, . . . , uin+\  e W, 

v = ((...  (, u ■ uq)  ■ w2)  ■ . ■ ■ w„)  ■ wn+i  , or 

v = ((. . . (M  • Wi)  ■ w2) . . . • wn)  o Wn+i  . 

Laver  [92]  showed  that  if  Eg  / 0 for  some  S,  then  <L  is  irreflexive,  and  used 
this  result  to  obtain  a normal  form  theorem  for  W/=  showing  that  <L  actually 

linearly  orders  W / =,  from  which  24.7  as  well  as  the  solvability  of  the  word 

problem  for  W / = follows  (this  is  the  problem  of  effectively  deciding  whether 
u = v or  not  for  arbitrary  u,v  e W).  Considerable  interest  was  generated  by  a 
hypothesis  bordering  on  the  limits  of  consistency  entailing  solvability  in  finitary 
mathematics,  particularly  because  of  the  peculiar  and  enticing  possibility  that  some 
strong  hypothesis  may  be  necessary.  Dehornoy  [92]  provided  another  approach 
to  the  freeness  and  solvability  conclusions  from  the  irreflexivity  of  <l-  A few 
years  later  in  late  1991,  he  [92a,  94]  settled  matters  by  establishing  that  irreflexivity 
outright  in  ZFC  with  an  elegant  argument  based  on  an  embedding  into  an  extension 
of  the  infinite  braid  group.  Consequently,  all  the  various  structure  results  obtained 
about  W /=,  including  the  normal  form  results  of  Laver  [92, 94],  are  now  theorems 
of  ZFC.  This  is  yet  another  testament  to  the  fecundity  of  large  cardinals:  It 
suggested  a simple  algebraic  structure  and  inspired  its  further  study  in  connection 
with  elementary  embeddings  and  critical  points.  Much  of  the  structure  understood, 
the  focus  became  the  question  of  the  irreflexivity  of  a certain  order,  and  the  large 
cardinals  were  gracefully  retired  from  the  field.  See  the  monograph  Dehornoy 
[00]  for  developments  in  braid  groups  and  distributivity. 

The  study  of  (£$,  ■)  has  raised  other  interesting  finitary  issues.  For  j e £&, 
let  k = crit(j')  and  let  Aj  be  the  closure  of  { j}  in  {£$,  ■).  Define  a corresponding 
function  / on  co  by: 

f{ii)  = |{crit(fc)  | k eAj  A jn(K)  < cnt(k)  < j"+1(ic)}\  . 

Laver  had  noted  that  /( 0)  = 0,  /( 1)  = 0,  and  /( 2)  = 1,  but  that  /( 3)  is  large. 
Dougherty  [93,  96]  established  a large  lower  bound  for  /( 3)  and  showed  moreover 
that  / eventually  dominates  the  Ackermann  function,  and  hence  cannot  be  primi- 
tive recursive.  It  was  not  even  clear  that  each  / (n ) is  finite,  but  using  a result  of 
Steel,  Laver  [95]  duly  established  this,  so  that  the  ordertype  of  [crit(L)  | k e Aj)  is 


24.  The  Strongest  Hypotheses 


331 


a>.  The  argument  involved  a tower  of  finite  algebras  which  may  be  defined  without 
reference  to  large  cardinals.  Laver  showed  that  a simple  finitistic  proposition  0 
about  the  tower  is  a consequence  of  13,  and  at  present  it  is  not  known  whether 
0 is  provable  in  ZFC.  Using  Dougherty  [93],  Dougherty  and  Jech  in  their  [97] 
showed  that  0 cannot  be  established  in  Primitive  Recursive  Arithmetic. 

Of  course,  the  extent  of  structure  at  the  level  of  10-13  may  be  due  to  their 
inconsistency!  However,  just  as  in  the  case  of  Silver’s  0#  investigations  for  which 
one  motivation  was  surely  the  prospect  of  a possible  refutation  of  measurability, 
the  further  elaboration  of  coherent  structure  at  the  level  of  10-13,  despite  the 
stalking  of  possible  new  inconsistencies,  may  reinforce  confidence  in  these  strong 
hypotheses. 

n -Hugeness 

The  remainder  of  this  section  is  devoted  to  the  main  hypotheses  between  ex- 
tendibility  and  10-13,  as  charted  out  in  Solovay-Reinhardt-Kanamori  [78].  By 
Kunen’s  result,  if  j'.V^cM  with  crit(y)  = at,  then  M is  not  closed  under  se- 
quences of  length  X — sup([j''(k)  | n e a>}).  It  is  natural  to  step  back  and  consider 
weaker  closure  conditions.  For  n e co, 

k is  n-huge  iff  there  is  a j:  V -<  M with  crit(/)  = k 
and  c M . 

k is  huge  iff  k is  1-huge  . 

k is  almost  huge  iff  there  is  a j:  V < M with  crit(j)  = k 
and  Y M c M for  every  y < j (k)  . 

Note  that  k is  0-huge  iff  k is  measurable',  that  if  j:  V -<  M witnesses  the  n- 
hugeness  of  k,  then  j"(K ) is  measurable  (in  M and  hence  in  V as  — 

V(jn (k)));  and  similarly,  if  j:  V < M witnesses  the  almost  hugeness  of  k,  then 
j (/c ) is  inaccessible  in  V.  We  shall  see  that  the  existence  of  an  almost  huge 
cardinal  is  consistency-wise  much  stronger  than  the  existence  of  an  extendible 
cardinal,  n-huge  cardinals  are  reminiscent  of  y -supercompact  cardinals  and  will 
have  a similar  characterization  in  terms  of  ultrapowers  that  provides  a formalization 
in  ZFC.  However,  there  is  a crucial  difference:  y -supercompactness  is  formulated 
in  terms  of  an  a priori  y as  the  degree  of  closure  of  M (and  requires  y <j  (*)), 
but  n -hugeness  posits  closure  only  a posteriori  in  that  M is  to  be  closed  under 
j"(K)  sequences,  however  large  jn(K)  may  be.  Even  then,  an  attempt  has  been 
made  to  motivate  n -hugeness  as  a reflection  property  by  Victoria  Marshall  [89]. 

Hugeness  was  formulated  by  Kunen  in  1972  in  order  to  have  a sufficient 
hypothesis  for  a relative  consistency  result.  As  mentioned  at  the  end  of  § 1 7,  Kunen 
[78]  established  that  if  there  is  a huge  cardinal,  then  there  is  a forcing  extension 
in  which  there  is  an  ^-saturated  ideal  over  co\.  The  conclusion  was  known  to 
have  consistency  strength  stronger  than  the  existence  of  a measurable  cardinal, 
and  this  was  the  first  time  that  a hypothesis  stronger  than  measurability  was  used 
in  a relative  consistency  result  about  the  low  orders  of  the  cumulative  hierarchy. 
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Since  then,  supercompactness  and,  to  some  extent,  hugeness  have  played  the  major 
such  roles.  Most  arguments  have  depended  on  the  reflection  phenomena  provided 
by  supercompactness,  but  on  occasion  the  relative  consistency  of  two-cardinal 
propositions  has  made  use  of  the  interplay  among  k,  j(i<)  and  j(K)M  c M. 

Refinements  have  followed  once  upper  bounds  on  consistency  strength  have 
been  established.  For  example,  almost  hugeness  became  more  focal  when  it  was 
realized  that  it  could  replace  hugeness  in  Kunen’s  argument.  Then,  as  part  of  an 
important  development,  Foreman-Magidor-Shelah  [88]  further  reduced  the  hypoth- 
esis in  Kunen’s  result  to  supercompactness,  and  then  Shelah  reduced  the  hypothesis 
to  an  essentially  equiconsistent  one  (see  32.10  and  before).  Almost  hugeness  is 
still  needed,  however,  to  get  an  o>3 -saturated  ideal  over  o>2  ■ While  it  is  unlikely 
that  equiconsistency  results  for  supercompactness  and  hugeness  lie  in  this  direc- 
tion of  combinatorial  propositions,  these  hypotheses  have  considerably  clarified 
their  consistency  strength. 

The  following  characterization  is  implicit  in  the  proof  of  24.2.  Adapting  a 
previous  definition,  for  a filter  F over  some  V(X)  (i.e.  F c V(P(X)), 

F is  normal  iff  (i)  for  any  a < X,  {x  G V(X)  \ a G x}  G F , and 
(ii)  for  any  (Xa  \ a < X)  e XF,  Aa<xXa  e F . 

24.8  Theorem.  k is  n-huge  iff  k > to  and  there  is  a K-complete  normal  ultrafilter 
U over  some  V(X)  and  cardinals  k — Xq  < X\  <...<  X„  — X so  that  for  each 
i < n, 

{x  G FIX')  | ot(x  ft  = A./}  G U . 

Proof.  Suppose  first  that  j : V ~<  M witnesses  the  n -hugeness  of  k.  Set  Xj  — j'  (k) 
for  i <n,  and  define  U over  V(Xn)  by: 

XeU  iff  X c V(Xn)  A j“Xn  g j(X)  . 

Then  it  is  simple  to  check  that  U is  a a: -complete  normal  ultrafilter.  Now  note 
that  for  i < n,  ot (j“Xn  Pi  j(Xl+\))  = j(X,). 

For  the  converse,  take  j:  V ^ M = Ult(y,  U).  Then  [id]c/  = j“X  where 
id:  V(X)  ->  V(X)  is  the  identity  map,  and  so  XM  C M.  Also,  for  i < n 

•^-i+l  = °t( j“X  Pi  j(Xi+ 1)) 

= [(ot(x  Pi  Xi+fi  | x G V(X))]u 
— [(^-i  I X G V(X))\i j 

= j(h)  ■ 

Finally,  the  case  i — 0 and  /r-completeness  implies  that  crit(y')  — k.  H 

This  leads  to  the  expected  transcendence  results  by  the  usual  reflection  argu- 
ment: 

24.9  Exercise. 

(a)  If  there  is  a j.  Vg  < Vg  as  in  13  with  crit  (j)  — k,  then  there  is  a normal 
ultrafilter  U over  k such  that 
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{a  < k | a is  n-huge  for  every  n]  e U . 

(b)  If  k is  (n  + l)-huge,  then  there  is  a normal  ultrafilter  U over  k such  that 

{a  < k | a is  n-huge}  e U . H 

24.8  also  shows  that  fi  is  n-huge1  is  UfF  by  an  argument  (before  22.9)  for 
y -supercompactness.  This  leads  to  the  following  observation. 

24.10  Exercise  (Morgenstem  [79]).  If  k is  supercompact  and  there  is  an  n-huge 
cardinal  above  k,  then  there  are  k n-huge  cardinals  below  k.  In  particular,  if  there 
are  supercompact  and  n-huge  cardinals,  then  the  least  n-huge  cardinal  is  less  than 
the  least  supercompact  cardinal. 

Hint.  For  any  a < k,  consider  the  iTp  proposition  r there  is  an  n-huge  cardinal 
above  a1  and  apply  22.3.  H 

It  will  soon  be  shown  that  the  existence  of  a huge  cardinal  implies  the  con- 
sistency of  the  existence  of  supercompact  cardinals  in  a strong  sense.  24.10  is  a 
consequence  of  supercompactness  as  a global  reflection  property  and  is  unrelated 
to  the  consistency  strength  of  hugeness. 

Almost  hugeness  has  a characterization  in  terms  of  sequences  of  ultrafilters. 
Suppose  that  k < X,  and  for  k < y < X,  Uy  is  a normal  ultrafilter  over  VKy. 
Then  (Uy  \ k < y < X)  is  a coherent  sequence  iff  whenever  k < y < S < X, 
U$\y  — Uy.  This  refers  to  the  projection  process  described  after  22.12;  setting 
jY\  V < My  = Ult(V,  Uy)  there  are  corresponding  embeddings  kyg:  My  -<  Ms 
such  that  js  = kys  ° jY,  given  by 

kys([(f  O)  I v e VKy)]ur)  = [{.fix  fly)  \ x e VKS)]Us  . 

24.11  Theorem.  k is  almost  huge  iff  there  are  an  inaccessible  X > k and  normal 
ultrafilters  Uy  over  VKy  for  k < y < X such  that  {UY  \ k < y < X)  is  coherent, 
and  the  corresponding  jy 's  and  kys ’s  satisfy:  if  k < y < X and  y < a < jy{ic), 
then  there  is  a S such  that  y < 8 < X and  kYs(u ) = 8. 

Proof.  For  the  forward  direction,  let  j:  V < M witness  the  almost  hugeness  of  k. 
Set  X = 7 (at),  so  that  X is  inaccessible.  For  k <y  <X,  define  UY  by: 

XeUy  iff  X c VKy  a f‘y  ej(X)  . 

As  readily  seen,  each  Uy  is  a normal  ultrafilter  over  VKy  and  {Uy  \ k < y < X) 
is  a coherent  sequence. 

Suppose  next  that  k < y < X and  /:  VKy  ->  k is  such  that  y < [f]u  < 
jy {k ) . Noting  that  y = otffid]^)  where  id:  VKy  — > VKy  is  the  identity,  the 
definition  of  UY  implies  that  y — ot (j“y)  < j(f)ij“y)  < j (k)  = X.  Set 
8 = ;(/)(/»•  Then  ot(y“<5)  = 8 = j(f)(j“8  n j(y)),  and  so 

[x  e Vk8  | ot(.r)  = f(x  fi  /)}  e Us  . 

Hence,  kys([f]uv)  — 8 by  the  definition  of  kys. 
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For  the  converse,  note  that 

{{My  | k <y  < X),  ( kyS  \ k < y < 8 < X)) 

is  a directed  system,  so  let  M be  its  direct  limit.  M is  well-founded  since  X is 
inaccessible,  so  it  can  be  assumed  that  M is  an  inner  model.  There  are  corre- 
sponding j : V -<  M and  ky : My  -<  M for  k < r < x such  that  j — kY  o jy.  Note 
for  what  follows  that  if  a < y < 8 < X,  then  kys(a)  = a (22.12(b)),  and  thus 

ky(a)  — a. 

To  complete  the  proof,  we  show  that  ^ M c M for  every  /J  < X and  that 
j(K)  = X.  If  f < X and  s — (xa  \ a < yd)  e ^ M,  then  by  the  regularity  of  X there 
are  y with  yd  < y < X and  {ya  \ a < ft}  c My  such  that  kY(ya ) = xa  for  a < f$. 
t = {y a | a < /l)  e My  since  7 My  c My,  and  noting  that  ky  (ft)  — yS,  it  follows 
that  kY(t)  — s e M. 

To  show  that  j(ic)  — X , note  first  that  if  y < K then  since  y < jr(K), 
y < kY{y)  < ky(jy(K))  = j(K ) . 

Hence,  j(i<)  > X.  For  the  other  direction,  suppose  that  p < j{K).  There  is  a y 
and  an  a < jy{K ) such  that  ky{a)  — ft.  If  a < y,  then  yd  = ky(a)  — a < jy{K). 
If  a > y,  then  by  hypothesis  there  is  a 8 such  that  y < 8 < X and  ky$(a)  = 8\ 
but  k$(8)  — 8,  and  so  it  follows  that  yd  = k&(8)  — 8 < js(i<)-  In  either  case,  there 
is  some  f < X such  that  yd  < (k),  and  so  by  22.11(c) 

p<j({K)  < {2^‘)+  <X 

as  X is  inaccessible.  Hence,  j (k)  — X.  H 

24.12  Exercise.  If  k is  huge,  then  there  is  a normal  ultrafilter  U over  k such  that 

{a  < k | a is  almost  huge}  e U . H 

Beyond  these  results  from  Solovay-Reinhardt-Kanamori  [78],  the  theory  of 
huge  cardinals  has  been  pursued  in  several  papers.  Robert  Mignone  [84]  and 
Carlos  Di  Prisco  and  Wiktor  Marek  in  their  [84, 85]  analyze  the  ultrafilter  char- 
acterization, and  It  Beng  Tan  [81],  Barbanel-Di  Prisco-Tan  [84],  Di  Prisco-Henle 
[85],  Abe  [86],  Di  Prisco-Marek  [88],  and  Barbanel  [89,91,91a]  consider  vari- 
ous generalizations  and  elaborations.  The  following  is  a global  generalization  of 
hugeness  in  the  spirit  of  supercompactness  and  extendibility: 

k is  superhuge  iff  for  any  y there  is  a j:  V -<  M with 

crit(j)  = at,  i<K> M C M,  and  y <J(K)  ■ 

The  following  shows  how  this  concept  fits  into  the  overall  scheme: 

24.13  Theorem  (Barbanel-Di  Prisco-Tan  [84]). 

(a)  If  k is  superhuge,  then  k is  extendible  and  there  is  a normal  ultrafilter  U 
over  k such  that 


{a  < k | a is  extendible]  e U . 
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(b)  If  k is  2 -huge,  then  there  is  a normal  ultrafilter  U over  k such  that 
{a  < k | VK  |=  a is  superhuge}  e U . 

Proof,  (a)  First,  it  is  simple  to  produce  normal  ultrafilters  over  VKy  for  every 
Y > k to  show  that  k is  supercompact.  With  r]  arbitrary,  we  next  show  that  k 
is  ?;-extendible:  Let  j:  V -<  M witness  the  hugeness  of  k with  rj  < j(t<).  Set 
e = j\VK+tl;  since  j(ic)  is  inaccessible,  e e M and  so  M |=  k is  ?;-extendible.  But 
by  elementarily  M \=  j(K)  is  supercompact,  and  so  by  the  argument  for  23.9(a), 
M h (Vm  \=  k is  ?; -extendible).  Finally,  V^K)  = VpK),  and  so  k really  is 
?7-extendible. 

To  complete  the  proof  of  (a),  note  that  the  above  argument  shows  that 
if  j:  V -<  M is  any  witness  to  the  hugeness  of  k,  then  M \=  (Vj(k)  1= 
k is  extendible).  Flence,  if  U is  the  normal  ultrafilter  over  k defined  from  j, 
then  A = {a  < k \ VK  [=  a is  extendible}  e U.  But  k is  extendible,  so  by  23.11 
every  a e A is  extendible. 

(b)  Temporarily  denote  by  a — > (fi)  the  following:  there  is  a j\  V < M 
witnessing  the  hugeness  of  a with  j (a)  = f>.  By  24.8,  this  amounts  to  asserting 
the  existence  of  a certain  ultrafilter  over  V(f).  Let  j:  V < M witness  the  2- 
hugeness  of  k.  Then  M |=  k — > (j(K))  since  7 C M,  and  so  if  U is  the 
normal  ultrafilter  over  k defined  from  j,  then  A — {a  < k |a— » (v)}  e U. 
For  any  a e A,  M |=  a —>  (at),  and  so  {§  < k \ a ->  (f )}  e U and  hence  is 
unbounded  in  k.  Consequently,  A = {a  < k \ VK  |=  a is  superhuge}.  H 

Vopenka’s  Principle 

Finally,  a circle  is  completed  by  going  back  to  a model-theoretic  hypothesis  that 
was  first  considered  about  the  time  that  extendible  cardinals  were  formulated.  This 
is  Vopenka ’s  Principle: 

For  any  proper  class  of  structures  for  the  same  language, 

there  is  one  that  is  elementarily  embeddable  into  another  . 

This  hypothesis  has  an  immediate  appeal  owing  to  its  unencumbered  statement  and 
its  readily  discernible  motivation  in  the  set  vs.  class  distinction.  Given  a proper 
class  {AAa  I 01  e On}  of  structures  for  the  same  language,  there  must  be  two,  in 
fact  a proper  subclass,  that  have  the  same  theory.  Vopenka’s  Principle  can  be  mo- 
tivated in  terms  of  a further  resemblance  based  on  the  elements  of  the  structures: 
Considerations  of  the  richness  of  proper  classes  and  the  uniformity  enforced  by  the 
class  definition  suggest  that  in  cases  like  Ma  — {Va,  e)  one  structure  ought  to  be 
an  elementary  substructure  of  another.  On  the  other  hand,  if  Ma  — (VYa,  £,  {a}) 
where  ya  > a,  Ma  < Mp  fails  for  a < ft  because  of  the  singleton  predicates. 
Flowever,  this  can  be  rectified  by  positing  an  association  up  to  a renaming  of  the 
elements,  i.e.  an  elementary  embedding  j:  Ma  -<  Mp,  which  must  necessarily 
satisfy  j(a ) = fi.  This  step  from  elementary  substructure  to  elementary  embed- 
dability,  necessary  to  state  a general  resemblance  principle  involving  elements,  is 
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a dramatic  strengthening.  Several  people  quickly  saw  what  was  suggested,  that 
Vopenka’s  Principle  implies  the  existence  of  extendible  cardinals. 

Vopenka’s  Principle  cannot  be  formulated  in  ZFC.  Its  consequences  were 
pursued  in  class  theory  in  Solovay-Reinhardt-Kanamori  [78],  but  this  work  will 
be  cast  here  in  terms  of  inaccessible  k so  that  VK  |=  Vopenka’s  Principle  with 
arbitrary  rather  than  definable  “classes”  X C VK,  as  in  Kanamori  [78].  To  focus 
the  discussion,  a sequence  of  structures  (Ma  \ a < k)  is  natural  iff  each  Ma 
has  form  ( Vf<a),  e,  ]a],  Ra)  where  Ra  c Vf  (ff)  and  a < f < k implies  that 
a < /( a)  < f(fi)  < k.  Through  coding  we  can  construe  as  natural  those 
sequences  where  Ra  is  replaced  by  a finite  number  of  relations.  The  specification 
of  ]a]  in  Ma  ensures  that  whenever  a < f>  and  j:  Ma  -<  Mp,  j must  move 
some  ordinal  since  j (a)  = f. 

In  the  spirit  of  the  study  of  indescribability  in  terms  of  corresponding  filters 
(§6),  define  for  inaccessible  k the  contextual  notion  of  non-negligible  subset  of  k: 
For  X C k, 

X is  Vopenka  in  k iff  for  any  natural  sequence  ( Ma  | a < k) 
there  is  a j : Ma  < Mp  for  some 
a < f < k with  critical  point  in  X , 

and 

k is  Vopenka  iff  k is  Vopenka  in  k . 

This  leads  to  the  consideration  of  the  collection  of  the  all  but  negligible  subsets, 

F — [X  C.  k | /c  — X is  not  Vopenka  in  k}  . 

Note  that  X e F iff  there  is  a natural  sequence  ( Ma  \ a < k)  such  that  for  any 
a < ft  < k and  j:  Ma  < Mp,  its  critical  point  is  in  X.  Also,  if  X,  Y e F,  then 
VnT  e F by  pointwise  amalgamating  the  two  corresponding  natural  sequences 
into  one.  Finally,  if  ( Aa  \ a < k)  e K VK  is  any  sequence  of  structures  for  the  same 
language,  then  each  Aa  can  be  encoded  into  an  Ma  so  that  (Ma  \ a < k)  a natural 
sequence,  and  so  if  X is  Vopenka  in  k,  there  ar e a < f < k and  j:  Aa  -<  Ap 
such  that  if  it  has  a critical  point,  then  it  belongs  to  X.  These  remarks  show  that 

F is  a (proper)  filter  iff  VK  |=  Vopenka’s  Principle  iff  k is  Vopenka  . 

F is  the  Vopenka  filter  over  k,  and  that  it  is  indeed  a (proper)  filter  will  be  implicit 
in  the  use  of  this  terminology.  Although  in  terms  of  §6,  k being  Vopenka  is  a n\ 
property  of  VK  and  hence  does  not  even  imply  the  weak  compactness  of  k,  the 
following  propositions  provide  structural  information  which  shows  that  it  implies 
the  existence  of  extendible  cardinals  in  VK  in  a strong  sense. 

24.14  Proposition.  The  Vopenka  filter  over  k is  normal. 

Proof  Suppose  that  X is  Vopenka  in  k and  f:  X k is  regressive.  Assume  to 
the  contrary  that  for  each  y < k,  /_1({yl)  is  not  Vopenka  in  k,  i.e.  there  is  a 
natural  sequence  (Ma  | a < k)  such  that  for  any  elementary  embedding  of  one 
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structure  into  another  with  critical  point  p,  f(p)  fi  y . Define  a natural  sequence 
(J\fa  | a < k)  by  setting 

Afa  = {Vg(a)+a >,  e,  {«},  {Ml  | y < a),  f\a) 

where  Vg(a]  is  the  union  of  the  domains  of  Ml  for  y < a.  Since  X is  Vopenka, 
there  is  a j:  Ma  -<  Afp  with  a critical  point  p e X.  But  then  /(p)  < ?;  implies  that 
j(ftn))  — f{rf)  so  that  j\Mfa(,})\  M{,(r,)  < Mfp'])  with  critical  point  p,  which  is 
a contradiction.  H 

This  result  is  the  n — 1 case  of  a larger  scheme  in  Kanamori  [78],  which 
analyzes  hypotheses  cascading  up  alongside  the  n-huge  cardinals. 

24.15  Proposition.  {§  < k \ VK  \=  £ is  extendible } is  in  the  Vopenka  filter  over  k. 
Proof.  Define  g e kk  by: 

' § if  VK  fi  | is  extendible  , else 

8(%)  = | where  ?;  is  the  least  ordinal  such 

that  VK  |=  § is  not  ^-extendible  . 

SetC  = {p  < k | g\p:  p}.  Then  C is  closed  unbounded,  so  that  C is  in  the 

Vopenka  filter  over  k by  24.14.  So,  let  ( Ma  \ a < k)  be  a natural  sequence  such 
that  for  any  elementary  embedding  of  one  structure  into  another  with  a critical 
point  p,  p e C.  Now  define  a natural  sequence  (Afa  \ a < k)  by  setting 

Afa  = <Vy„,  e,  [a],  Ma,  C n ya) 

where  ya  is  the  least  limit  point  of  C greater  than  every  ordinal  in  the  domain  of 
Ma . To  complete  the  proof,  it  suffices  to  show  that  if  j : Afa  < Afp  with  a critical 
point  f , then  f is  extendible: 

Assume  to  the  contrary  that  this  fails,  so  that  g(f)  > §.  Since  £ < ya  and 
ya  e C,  g(§)  < ya.  It  follows  that  j\Vg^y.  Vg -<  V/(g(|))  with  critical  point 
Also,  note  that  f e C as  Ma  is  encoded  in  J\fa.  Hence,  j(fi)  e C as  C fi  ya  is 
encoded  in  Afa.  It  then  follows  from  % < j(f)  that  g(f)  < jifi).  But  then,  if 
g(^)  — § + p,  these  properties  show  that  VK  fi  § is  ?;-extendible,  contradicting 
the  definition  of  g(£).  H 

The  following  proposition  shows  that  the  Vopenka  filter  is  closed  under  a 
strong  analogue  of  Mahlo’s  operation. 

24.16  Proposition.  If  X is  in  the  Vopenka  filter  over  k,  then  so  also  is  the  set 
{§  < k | £ is  measurable  and  there  is  a normal  ultrafilter  over  f containing 

xnf}. 

Proof.  Define  a natural  sequence  {Ma  \ a < k)  by  setting 


Ma  = {Va+lo,  e,  [a],  {X  fi  a}}  . 
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It  suffices  to  show  that  if  j:  Ma  < Mp  with  a critical  point  f e X,  then  there  is  a 
normal  ultrafilter  over  £ containing  X flf . But  if  U is  the  usual  normal  ultrafilter 
over  | defined  from  j,  then 

j{x n f)  = j(X n « n £)  = j(x  n a)  n j($)  = (xn p)  n j ($)  = xn ;(?) 

and  $ e X n ) so  that  X n £ e U.  H 

24.17  Corollary.  If  k is  Vopenka,  then  k is  K-Mahlo  and  the  set 

{f  < k | VK  N ? is  extendible  and  there  is  a normal 

ultrafilter  over  § containing  {£  < § | VK  |=  ^ is  extendible}} 

is  stationary  in  k. 

Proof.  By  24.14,  24.15,  and  24.16.  H 

William  Powell  [72]  and  others  situated  Vopenka’s  Principle  into  the  large 
cardinal  hierarchy  below  almost  hugeness.  Typically,  one  can  establish  much  more 
than  is  needed: 

24.18  Theorem.  Suppose  that  k is  almost  huge.  Then  there  is  a normal  ultrafilter 
U over  k such  that: 

(a)  For  any  natural  sequence  {A4a  \ a < k)  there  is  a Y e U such  that  a < /3 
both  in  Y implies  that  there  is  an  elementary  embedding:  A4a  < M.p  with  critical 
point  a. 

(b)  {a  < k | a is  Vopenka}  e U. 

Proof.  Let  j:  V < M witness  the  almost  hugeness  of  k and  U the  normal  ultrafilter 
defined  from  j . To  establish  (a),  let  (Ma  \ a < k)  be  a natural  sequence.  Setting 
Xa  = {f  < k | there  is  an  elementary  embedding:  Ma  -<  with  critical  point 
a}  for  a < k and  T = {a  < k \ Xa  e U},  we  shall  show  that  T e U.  Granted 
this,  the  normality  of  U would  imply  that 

Y = {a  e T \ a e fl^n e U , 

and  clearly  this  Y would  be  as  desired. 

To  show  that  T e U,  set 

j((Ma  | a <k))  = (M*  | a < , 

j((M*a  | a < ./(*)»  = (M*f  | a < J2{k))  , and 
j({Xa  | a < k))  = (X*  | a < j(K))  . 

For  any  a < k,  noting  that  M*  — Ma  we  have:  Xa  e U iff  k e j(Xa)  iff  there 
is  an  elementary  embedding:  Ma  -<  M*  with  critical  point  a.  So  if  a < j (k),  by 
elementarily  we  have:  X*  e j(U)  iff  in  M,  there  is  an  elementary  embedding: 
M*a  -<  M**k)  with  critical  point  a.  Hence,  T e U iff  k e j{T)  iff  X*  e j(U ) iff 
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in  M,  there  is  an  elementary  embedding:  M*  < with  critical  point  k.  But 

setting  J - j\M*,  J:  M*  < M**k).  Also,  j is  a subset  of  M with  cardinality 
that  of  the  domain  of  M*,  which  in  turn  is  less  than  j (7c).  Hence,  j e M,  and  so 
T eU. 

For  (b),  just  note  that  by  (a),  M \=  k is  Vopenka.  H 

Finally,  characterizations  of  Vopenka’s  Principle  are  drawn  from  Solovay- 
Reinhardt-Kanamori  [78]  in  terms  of  “boldface”  versions  of  extendibility  and 
supercompactness.  Characterizations  in  terms  of  model-theoretic  logics  were  pro- 
vided by  Edward  Fisher  [77]  and  Johann  Makowsky  [85].  The  following  defini- 
tions serve  present  purposes:  For  any  set  A and  k < q, 
k is  q-extendible  for  A iff  there  is  a £ and  a 

j:  <v„,e,A  n v„>  -<  (vE,eAnyf) 

with  crit(j ) = k and  q <j(K)  , 
k is  q-supercompact  for  A iff  there  is  an  a < k and  an 

e\  (Va,  e,  A fi  Va)  < {Vn,  e,  A n Vn) 
with  e(8)  — k,  where  S — crit(e)  . 

The  second  definition  is  based  on  the  characterization  22.10.  Injecting  A’s  into 
previous  arguments  lead  to  characterizations: 

24.19  Exercise.  The  following  are  equivalent  for  inaccessible  k: 

(a)  k is  Vopenka. 

(b)  For  any  A C VK  there  is  an  a < k q- extendible  for  A for  every  q < k. 

(c)  For  any  A C VK  there  is  an  a < k q-supercompact  for  A for  every  q < k. 

Hint.  The  argument  for  24.15  adapts  to  establish  (a)  ->  (b). 

For  (b)  ->  (c),  show  that  for  any  Ac  VK,  if  a is  ^-extendible  for  A for  every 
q < k,  then  a is  p-supercompact  for  A for  every  q < k\  For  such  an  a let 

j:  <y„+a),  e,  A n V„+£U>  -<  (Vf , e,  A n V() 
with  crit(y')  = a and  q < j(a).  Set  j = j\Vn.  Then  j e Vj,  and  noting  that 

A n v,  = (A  n Vm)  n v„  = j (A  n v„)  n vn, 

J:  <V„  e,  j(A  n Vf)  n Vf  < {Vm,  e,  j(A  n Vf)  . 

Now  apply  the  elementarily  of  j (cf.  22.10). 

Arguing  with  natural  sequences  as  A’s  establishes  (c)  -*  (a).  H 

The  strong  hypotheses  from  Reinhardt’s  extendibility  to  Kunen’s  inconsis- 
tency were  formulated  and  systematized  into  a linear  hierarchy  in  a relatively 
short  time.  Since  then,  this  edifice  has  proved  resilient,  reinforced  by  relative 
consistency  results  from  hugeness  and  even  10-13,  and  the  further  elucidation  of 
structure  at  these  hypotheses.  Of  course,  a new  inconsistency  result  would  be  an 
exciting  development,  but  as  time  goes  by,  the  further  analysis  and  application  of 
these  hypotheses  suggest  that  they  may  be  approached  with  increasing  confidence 
if  not  acceptance. 
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25.  Combinatorics  of  VKy 

The  emergence  of  VK  y in  the  study  of  supercompactness  led  to  several  combina- 
torial problems  about  this  set  that  stimulated  considerable  research;  this  section  is 
devoted  to  the  work  through  the  1980’s  in  this  offshoot  direction.  For  convenience, 

k denotes  a regular  uncountable  cardinal  and  k < y in  this  section  . 

The  seminal  paper  in  this  area  was  Jech  [73],  which  in  the  spirit  of  the 
generalization  from  measurability  to  y -supercompactness  advanced  the  idea  of 
generalizing  combinatorial  properties  of  and  problems  about  ( k , <)  to  (' PKy,  C>. 
Basic  to  the  subsequent  work  was  the  Jech  [71]  generalization  of  the  concepts  of 
closed  unbounded  set  and  stationary  set:  For  X 

X is  unbounded  in  VKy  iff  for  any  y eVKy  there  is 

an  x 3 y such  that  x e X . 

X is  closed  in  VKy  iff  for  any  [xj  | | < f}  C X 

with  p < k and  x%  c xf  for 
£ < f < 0.  U s<fixH  e X . 

X is  closed  unbounded  in  VKy  iff  X is  closed  in  VKy 

and  unbounded  in  VKy  . 

X is  stationary  in  VKy  iff  X fl  C 0 for  any  C 

closed  unbounded  in  VKy  . 

The  “in  VKy ” is  deleted  when  clear  from  the  context.  Note  that  for  any  s eVKy, 
[x  e VKy  | .y  C x)  is  closed  unbounded,  and  so  if  X C VKy  is  stationary,  then  it 
is  unbounded.  David  Kueker  [72]  also  formulated  these  concepts  for  k — cl>\  for 
use  in  model  theory,  and  his  [77]  provides  a full  account  of  his  work  along  these 
lines. 

Jech’s  original  formulation  of  closed  in  VKy  was  different  and  Solovay  ob- 
served that  the  current  one  is  equivalent.  Because  of  its  usefulness,  that  first 
formulation  is  recovered  through  the  following  lemma.  A set  D is  c- directed  iff 
for  any  x,y  e D there  is  a z e fl  such  that  x U y c Solovay’s  observation 
turns  out  to  be  the  VKy  case  of  a result  about  transfinite  directed  sets: 

25.1  Lemma  (Cohn  [65:  33];  Isbell  [66]).  X is  closed  in  VKy  iff  for  any  C- directed 
D c X with  |D|  < k,  U D e X. 

Proof  The  substantive  direction  is  established  by  induction  on  |D|.  The  result 
is  immediate  if  |D|  is  finite.  Suppose  now  that  |Z)|  < at  is  infinite  and  for  each 
c-directed  E c X with  \E\  < \D\,  (j£  e X.  Enumerate  D as  {xa  \ a < |D|}. 
Recursively  define  Da  for  a < | D\  to  be  a C-directed  subset  of  D of  minimal 
cardinality  including  {v£,}U[J^<Q,  Dp.  By  a simple  closure  argument,  if  a is  finite, 
then  Da  is  finite,  and  otherwise  | Da\  = \ [Jp<a  Dp\  — |or|.  Set  ya  — [J  Da  for 
a < \D\.  Then  by  induction  each  ya  e X,  and  since  ya  c for  a < a < \D\, 
Uff<|D|  ya  = [J  D e X.  H 
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The  following  provides  analogues  to  most  of  0.1  and  is  established  in  like 
manner. 

25.2  Exercise  (Jech  [71]). 

(a)  If  r)  < k and  (Ca  \ a.  < if)  is  a sequence  of  sets  closed  unbounded  in 
VKy,  then  P)a<  Ca  is  closed  unbounded  in  VKy. 

(b)  If  (Cff  | a < y)  is  a sequence  of  sets  closed  unbounded  in  VKy,  then  its 
diagonal  intersection  Aa<yCa  — [x  e VKy  \ x e Haev  Ca}  's  dosed  unbounded 
in  VKy. 

(c)  If  S is  stationary  in  VKy  and  f is  a choice  function  of  S (i.e.  fix)  e x 

for  every  x e S — {0}),  then  there  is  a T c S stationary  in  VKy  such  that  f is 
constant  on  T . H 

Let  CKtY  denote  the  filter  generated  by  the  closed  unbounded  subsets  of  VKy: 

CK'Y  — {X  c VKy  | 3C(C  is  closed  unbounded  in  VKy  A C c I)]  . 

Then  by  25.2,  CK,y  is  a normal  filter  over  VKy,  called  the  closed  unbounded  filter 
over  VKy. 

Menas  established  a useful  basis  theorem  for  CKiY.  For  new  and  /:  [yf  — > 
VKy,  define 


C(f)  — {x  e VKy  | v is  infinite  A V.?  e [x]"(/(5)  c x)}  . 

Analogously  define  C(/)  for  /:  [y]<w  ->  VKy.  It  is  simple  to  see  that  these 
C(/)’s  are  closed  unbounded. 

25.3  Proposition  (Menas  [74]).  For  any  X C VKy,  X e CK,y  iff  there  is  an 
f:  [y]2  ->  VKy  such  that  C(f)  c X. 

Proof.  In  the  substantive  direction,  suppose  that  C is  closed  unbounded;  an 
/:  [y]2  ->  VKy  must  be  found  so  that  C(/)  c C.  To  this  end,  a g:  [y]<ro  ->  VKy 
is  first  found  so  that  C(g)  c C and  then  refined  later: 

Define  g(s)  for  5 e [y]<'u  by  recursion  on  |s|  as  follows:  Let  g(0)  be  any 
infinite  member  of  C,  and  if  |s|  =n+ 1,  let  g(s)  satisfy  .?U[g(r)  | t e [s]"}  c g(s) 
and  g(s)  e C.  Then  C(j)  c C:  If  x e C(g),  then  x is  infinite  and  IJfgCs)  I 5 e 
[x]<w]  c x,  and  so  equality  holds  since  s c g(s).  But  clearly,  [g(s)  | s e [x]<0J} 
is  c -directed,  and  so  by  25.1,  x e C. 

The  following  will  be  established  next: 

(*)  Whenever  1 < n < w and  h:  y"  ->  VKy, 

there  is  an  h:  [y]2  — > VKy  such  that  C(h)  c C(h)  . 

Once  done,  the  proof  can  be  completed  as  follows:  For  each  new  let  g„:  [y]2  -> 
VKy  be  such  that  C(g„)  c C(g\[y]n).  Define  /:  [y]2  VKy  by:  f{s)  — 
U„  Snis).  Then  C(f)  c fl,,  C(gn)  c C(g)  c X. 
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To  establish  (*),  it  will  be  convenient  to  show  that  whenever  n > 1 and 
h:  [y]"+1  ->  VKy,  there  is  an  h:  [y]"  — >■  VKy  such  that  C(h)  c C(h).  This 
suffices  by  finite  descent.  To  this  end,  let  p\  [y]2  y be  a bijection  such  that 
ma x(a,  /3)  < p(a,  f),  and  ttq,  jt\  \ y y be  such  that  p(jro(ot),  7ti(a))  — a for 
every  a < y.  Define  h:  [y]n  — >•  VKy  so  that  for  any  a \ < ...  < a„  < y, 
h(a\, , an)  e C(h)  and 

h(a l ocn)  3 {p(an_i,  a„)}  U . . . , a„_i,  n0(an),  nfia,,))  . 

Then  C(A)  c C(/j):  If  .r  e C(h ) and  ai  < ...  < a„+i  are  all  in  x,  then 
p{ an,  oin+\ ) e h(a i,  . . . , a„_2,  a„,  a„+i)  c x,  and  so 

h{a i, . . . , a,!+i)  C . . . , orB_i,  p(a„ , a„  +i))Cx.  H 

This  result  shows  in  particular  that  for  any  C closed  unbounded  in  VKy  there 
is  a C c C closed  unbounded  in  VKy  such  that  for  any  x,y  e C,  x C\y  e C.  If 
C is  closed  unbounded  in  VKy  so  that  for  any  x,  y e C,  x U y e C,  then  there  is 
an  /:  [y]1  ->  VKy  such  that  C(f)  c C:  For  a < X,  just  let  /({a})  e C satisfy 
a e /({a}).  However,  Menas  observed  that  for  y > k+ , the  [y]2  in  25.3  cannot 
be  replaced  by  [y]1.  In  particular,  for  such  y there  are  C closed  unbounded  in 
VKy  such  that  for  any  C c C closed  unbounded  in  VKy,  there  are  x,  y e C such 
that  xUy  C.  Donna  Carr  [82]  investigated  closed  unbounded  sets  closed  under 
unions;  she  also  observed  the  following,  almost  explicit  in  Menas  [74:  332],  which 
is  the  analogue  of  the  fact  that  the  closed  unbounded  filter  CK  is  included  in  every 
normal  filter  over  k. 

25.4  Proposition  (Carr  [82],  Menas  [74]).  If  F is  a normal  filter  over  VKy,  then 

i\.Y  L /•'. 

Proof.  Assume  that  this  fails;  then  by  the  previous  proposition  there  is  an 
/:  [y]2  — > VKy  such  that  X — VKy  — C(/)  is  F-stationary.  For  each  x e X, 
there  are  ax  < fix  both  in  x so  that  /( ax,  fix)  g x.  By  applying  normality  (22.6) 
twice,  there  are  /'’-stationary  KI  and  a < f such  that  for  every  x e Y,  ax  — a 
and  fix  — ft.  But  this  is  a contradiction,  since  Y is  unbounded  and  so  there  must 
be  a y eY  such  that  /(a,  fi)  c y.  H 

Jech  [73]  had  previously  observed  that  if  U is  a normal  ultrafilter  over  VKy, 
then  CK,y  c U. 

Pursuing  analogies,  the  next  problem  that  naturally  arises  is  to  generalize 
Solovay’s  result  16.9  that  every  set  stationary  in  k can  be  partitioned  into  k 
disjoint  stationary  sets.  Menas  [74:  337]  conjectured  for  X > k: 

Every  stationary  subset  of  VKX  can  be 

partitioned  into  X<K  disjoint  stationary  sets  . 

Most  of  the  results  have  been  about  this  assertion  with  X<K  replaced  by  X.  The 
first  observations  can  be  deduced  from  consideration  of  one  particular  stationary 
set. 
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25.5  Proposition.  Suppose  that  k < X and  S — {x  e VKX  \ \x  fl  a:  | = |x|}.  Then: 

(a)  S is  stationary  in  VKX. 

(b)  If  k is  a successor  cardinal,  then  S e CKtx- 

(c)  If  k < X and  k is  X-supercompact,  then  S CKtx- 

(d)  (Baumgartner)  If  X C S is  stationary,  then  X can  be  partitioned  into  X 
disjoint  stationary  subsets. 

Proof,  (a)  Suppose  that  C is  closed  unbounded  in  VKX.  Then  it  is  easy  to  find 
xn  eVKX  such  that  xn  c x„+\  for  new  with  X2,n  e S and  X2m+i  e C for  mem. 
It  follows  that  x«  e S n C. 

(b)  If  k = v+ , then  {x  e VKX  \ \x  fl/c | = v]  is  closed  unbounded  in  VKX,  and 
this  set  is  included  in  S. 

(c)  If  U is  a normal  ultrafilter  over  VKX,  then  k < X implies  by  22.1 1(a)  that 
{x  e VKX  | \x  fl  k\  < |x|]  e U.  But  CK;K  c U by  the  previous  proposition,  and  so 

5 i cK,k. 

(d)  For  each  x e X,  let  fx:  x — > x Ok  be  injective.  For  a < X,  define  ga  on 
the  stationary  set  {x  e X \ a e x}  by:  ga (x)  = /v(a).  Then  by  25.2(c)  there  is  a 
stationary  Xa  C.  X such  that  ga  is  constant  on  Xa,  say  with  value  < k. 

Now  for  any  v < X such  that  cf(u)  > k,  there  is  an  ?/  < k such  that 
|{or  < v | i]a  = t]}\  = v,  and  so  it  is  simple  to  see  that  {Xa  \ a < v A rja  = ?;}  is  a 
family  of  v disjoint  stationary  subsets  of  X.  Hence,  if  cf(A)  > k,  the  result  follows. 
If  on  the  other  hand  cf(A)  < k,  it  can  be  assumed  that  k < X,  else  the  result  follows 
from  Solovay’s  16.9  and  22.5(b).  So,  let  {pa  \ a < cf(A.)}  be  an  increasing 
sequence  of  regular  cardinals  cofinal  in  X with  k < po.  By  taking  v — k+  < X in 
the  above,  first  find  a partition  X — Ua<K+  Ya  into  disjoint  stationary  sets.  Then 
for  each  a < cf(A)  < k the  argument  with  X replaced  by  Ya  can  be  applied  to 
get  a partition  of  Ya  into  /x„  disjoint  stationary  sets.  Finally,  the  conglomeration 
of  these  partitions  is  a partition  of  X into  X disjoint  stationary  sets  as  desired. 

H 


25.6  Corollary.  Suppose  that  k < X.  Then: 

(a)  VKX  can  be  partitioned  into  X disjoint  stationaiy  sets. 

(b)  (Jech  [73]  for  regular  X;  Matsubara  [87])  If  k is  a successor  cardinal, 
then  every  stationary  subset  of  VKX  can  be  partitioned  into  X disjoint  stationary 
sets. 


Di  Prisco  (see  Di  Prisco-Marek  [82])  showed  that  for  regular  X > k,  every 
closed  unbounded  subset  of  VK X can  be  partitioned  into  X disjoint  stationary  sets. 
For  k weakly  inaccessible,  he  formulated  a concept,  super  stationary,  intermediate 
between  stationary  and  closed  unbounded,  and  showed  that  for  regular  X > k every 
superstationary  subset  of  VKX  can  be  partitioned  into  X disjoint  superstationary 
sets.  When  S ^ CKtx  as  in  25.5(c),  then  S is  an  example  of  a stationary  but  not 
superstationary  subset  of  VKX. 

In  1981,  Baumgartner  made  the  following  observation  about  the  set  S that 
reestablished  direct  connections  with  large  cardinals: 
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25.7  Theorem  (Baumgartner).  Assume  the  hypothesis  of  25.5.  If  0#  does  not  exist, 
then  S e CK,\>  and  so  eveiy  stationary  subset  of  VKX  can  be  partitioned  into  X 
disjoint  stationary  sets. 

Proof.  Assume  to  the  contrary  that  5 j CK:x-  Then  k < X.  Let  f:  X ->  Lx  be  a 
bijection  which  is  the  identity  on  k + 1.  Then 

C = [x  e VKX  \ x n k e k a k e x a f“x  -<  Lx} 

is  readily  seen  to  be  closed  unbounded.  By  assumption  VKX  — 5 is  stationary,  so 
there  is  a y e C such  that  \y  fl  k\  < |y|.  Let  La  be  the  transitive  collapse  of 
f“y  and  j the  inverse  of  the  collapsing  isomorphism,  so  that  j:  La  -<  Lx-  Setting 
8 = y IT  k,  since  / is  the  identity  on  k + 1 and  Key,  j(8 ) = k and  crit(j)  = 8. 
But  8 < |y|  = |a|,  and  so  by  21.1,  0#  exists.  H 

Hence,  the  refutation  of  Menas’s  conjecture  with  X<K  replaced  by  X neces- 
sitates the  existence  of  0#!  The  following  observation  made  in  passing  precludes 
the  possibility  of  developing  an  inner  model  theory  for  y -supercompactness  in  a 
naive  manner: 

25.8  Exercise.  Suppose  that  k < X and  U is  a normal  ultrafilter  over  VKX.  Then: 

(a)  VKX  ni  <£U. 

(b)  L[U]  = L. 

Hint.  For  (a),  assume  to  the  contrary  that  VKXC\ L e U , and  noting  that  VKX  — S e 
U as  for  the  proof  of  25.5(c),  use  the  proof  of  25.7  to  find  a y as  there  satisfying 
y e L to  derive  contradiction,  (b)  follows  from  (a).  H 

A contradiction  from  VKXC\  L e U can  also  be  derived  by  considering  the  0# 
Skolem  term  generation  of  members  of  VK  X fl  L and  stabilizing  components  with 
normality. 

With  a surprising  result  that  raised  the  level  of  sophistication  in  this  area, 
Gitik  in  1982  showed  relative  to  supercompactness  that  Menas’s  conjecture  can 
fail  in  a strong  sense: 

25.9  Theorem  (Gitik  [85]).  Suppose  that  k is  supercompact  and  X > k.  Then 
there  is  a p.o.  P that  preserves  cardinals  > k such  that  ||-/>  k is  inaccessible  A 35(5 
is  stationary  in  V tch  A S cannot  be  partitioned  into  k+  disjoint  stationary  sets). 

Gitik  observed  that  this  is  optimal  in  the  sense  that  for  any  X > k every 
stationary  subset  of  VKX  can  be  partitioned  into  k disjoint  stationary  sets,  using  a 
version  of  Solovay’s  argument  for  16.9.  He  also  noted  that  k can  also  be  made 
the  least  inaccessible  cardinal,  using  a construction  due  to  Woodin.  The  exact 
consistency  strength  of  the  proposition  that  there  is  a stationary  subset  of  Vkk+ 
that  cannot  be  partitioned  into  k+  disjoint  stationary  sets  is  unknown;  Gitik’s  proof 
established  as  an  upper  bound  the  existence  of  a k that  is  k +++  -supercompact. 
Recently,  Shioya  [03]  improved  the  upper  bound  to  the  existence  of  a k that  is 
k+ -supercompact  and  moreover  provided  a much  simpler  proof  of  25.9. 
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The  focal  set  S of  25.5  occurred  at  one  point  in  Gitik’s  argument.  Moreover, 
Baldwin  [83]  was  led  to  S in  the  course  of  generalizing  the  Mahlo  hierarchy  for 
applications  to  the  developing  inner  model  theory;  the  result  of  Jech  [86]  is  based 
on  S;  and  S serves  to  distinguish  between  versions  of  the  closed  unbounded  filter 
formulated  in  Matsubara  [88a].  Consider  the  possibility  S <£.  CKX- 

For  some  X,  {x  e V^X  \ \x  fl  k\  < |v|]  is  stationary  in  VKX  . 

As  a nice  example  of  how  questions  are  resolved  in  terms  of  large  cardinals, 
the  exact  consistency  strength  of  this  proposition  was  determined  in  the  linear 
hierarchy.  By  25.5(c)  and  25.7  the  consistency  strength  lies  between  the  existence 
of  0#  and  the  existence  of  a k that  is  k+ -supercompact.  Baldwin  [84]  localized 
it  to  the  lower  end  by  showing  that  it  suffices  to  take  X satisfying  a partition 
property  weaker  than  Ramsey  with  respect  to  a weakly  inaccessible  re  < X.  Then 
Donder-Koepke-Levinski  [88]  pinned  down  the  exact  partition  property  needed 
and  confirmed  its  necessity  using  the  newly  available  inner  model  theory  to  good 
effect.  As  was  already  evident  from  Levinski  [84: 229]  however,  for  k > or 3 
the  proposition  with  the  additional  requirement  that  X = k+  implies  at  least  the 
existence  of  0 ' , and  the  exact  consistency  strength  is  unknown. 

Menas’s  original  conjecture,  partitioning  into  X<K  sets,  was  refuted  in  ZFC 
with  k — a>  1 : 


25.10  Theorem  (Baumgartner-Taylor  [82]).  Assume  GOT  and  v = XH°.  Then  for 
any  coi-c.c.  p.o.  P,  |[/>  3 S(S  is  stationary  in  V„H X A S cannot  be  partitioned  into 
v++  disjoint  stationary  sets).  In  particular,  Con(ZFC)  implies  Con(ZFC  + 2K°  is 
large  + 3S(,S  is  stationary  in  Vmco2  A S cannot  be  partitioned  into  (04  disjoint 
stationary  sets)). 

On  the  other  hand,  Matsubara  established  the  consistency  of  Menas’s  conjec- 
ture as  a corollary  to  the  following  case: 

25.11  Proposition  (Matsubara  [90]).  Suppose  that  k<x  < X<K  — 2X.  Then  every 
stationary  subset  of  VKX  can  be  partitioned  into  X<K  disjoint  stationary  sets. 

Proof.  Note  first  that  if  Y is  unbounded  in  VKX,  then  VKX  C |J veYV(y)  so  that 
X<K  < k<k  ■ |T|,  and  k<k  < X<K  thus  implies  that  |T|  = 2X.  Next,  as  X<K  — 2X, 
let  (/„  | a < 2X)  enumerate  the  collection  of  functions:  [l]2  ->  VKX  so  that 
each  function  appears  cofinally  often.  In  terms  of  25.3,  each  C(fa ) is  closed 
unbounded,  and  for  any  Z e CK,x  and  £ < 2X,  there  is  an  a such  that  £ < a < 2X 
and  C(fa)  c Z. 

Suppose  now  that  X is  stationary  in  VKX.  For  each  a < 2X  choose  a sequence 
(??“  ||  < a)  of  distinct  members  of  X by  recursion  on  a as  follows:  Flaving 
chosen  such  sequences  for  < a,  first  note  that  X fl  C(fa)  is  stationary,  hence 
unbounded,  and  therefore  has  cardinality  2X  as  noted  above.  Let  (rj | | f < a)  be 
a sequence  of  distinct  members  drawn  from  (lnC(/„))-[i|  | jl  < a A|  < /3}. 
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For  | < 2X,  set  = {??"  | £ < a < 2X}.  Then  {Z^  | | < 2X]  is  a collection 
of  pairwise  disjoint  subsets  of  X.  The  proof  is  completed  by  showing  that  each 
Xj:  is  stationary:  If  C is  closed  unbounded,  then  as  noted  before  there  is  an  a 
such  that  § < a < 2X  and  C(fa)  c C.  But  then,  17“  e Z^  fi  C(fa)  c Z^  fi  C.  ~\ 

25.12  Corollary.  Assume  GCH  and  k < X.  Then: 

(a)  If  0#  does  not  exist,  then  every  stationary  subset  of  VKX  can  be  partitioned 
into  X<K  disjoint  stationary  sets. 

(b)  VKX  can  be  partitioned  into  X<K  disjoint  stationary  sets. 

Proof.  If  X<K  = X,  the  results  follow  from  25.6(a)  and  25.7,  and  if  X<K  > X,  from 
25.11.  H 

(b)  also  follows  from  the  work  on  combinatorial  principles  in  Donder-Matet 
[93],  which  showed  that  if2<K  < X,  then  VKX  can  be  partitioned  into  X<K  stationary 
sets,  improving  a result  of  Matsubara  [88].  Both  papers  include  results  showing 
that  some  such  hypothesis  is  necessary. 

A major  development  from  Jech  [73]  was  the  investigation  of  partition  prop- 
erties for  VKy.  Note  that  for  \y  \ > k every  unbounded  subset  of  VKy  has  members 
x\  c yi  and  X2  <£  yi-  Hence,  for  11  e cu  it  is  natural  to  consider  partitions  of 

YPkYTc  — {{*1 — > *«}  I x\  c . . . C x„  e VKy}  . 

For  functions  / on  such  sets,  f(x  1, . . . , xn)  is  written  for  f({x  1, . . . , x„})  with 
the  understanding  that  X\  C ...  C xn.  For  such  an  /, 

H ^VkY  is  homogeneous  for  f iff  there  is  an  i such  that  xi  C . . . C xn 

all  in  H implies  that  f(x  1, . . . , x„  ) = i . 

The  following  are  generalizations  (from  n = 2)  of  propositions  formulated  by 
Jech. 

Part(/r,  y)n  iff  for  any  /:  \VKy~\nc  2 there  is  an 

unbounded  H c.VKy  homogeneous  for  / . 

Part*(/r,  y)n  iff  for  any  /:  [VKy]nc  -»  2 there  is  a 

stationary  H C.'PKy  homogeneous  for  / . 

Jech  observed  that  Part(Ar,  y)2  implies  that  k is  weakly  compact,  and  raised  several 
questions. 

The  early  work  along  these  lines  was  done  around  1971  by  Solovay  and 
Menas,  who  considered  the  generalization  of  Rowbottom’s  7.17  to  normal  ultra- 
filters over  VKy.  For  U such  an  ultrafilter, 

U has  the  partition  property  iff  for  any  /:  [VKy]2c  — > 2 there  is 

a set  in  U homogeneous  for  / . 

This  property  implies  Part* (at,  y)2  by  25.4.  Except  for  the  case  y — k handled  by 
7.17  and  22.5(b),  that  every  normal  ultrafilter  over  VKy  has  the  partition  property 
was  not  evident.  Menas  found  a simple  characterization: 
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25.13  Theorem  (Menas  [76]).  For  U a normal  ultrafilter  over  VKy,  U has  the 
partition  property  iff  there  is  an  A e U such  that  x C y both  in  A implies  that 

\x\  < \y  H/c|. 

Proof.  In  the  forward  direction,  define  /:  [VKy]c  2 by:  fix,  y)  — 0 iff 
\x\  < |y  nx|.  Let  H e U be  homogeneous  for  /;  since  there  are  xq,  yo  e H such 
that  |*o | < | >’o  n /c  |,  the  desired  result  follows. 

The  converse  is  established  by  adapting  the  argument  for  Rowbottom’s  7.17: 
Let  A be  as  hypothesized,  and  suppose  that  /:  [' PKy ]£.  ->  2.  For  each  * e VKy 
define  fx:  [y  e VKy  \ x c y}  — > 2 by  /v(y)  = fix,  y).  Then  for  each  * e VKy 
there  is  an  Hx  e U and  an  ix  < 2 such  that  fx“Hx  = {if.  It  follows  that  there 
is  a B c A with  B e U and  an  i <2  such  that  for  any  x e B,  ix  = i . Set 
H - {x  e B \ x e f|{^  I y C x A y e S}).  Then  y C x both  in  H implies 
that  fix,  y)  = i,  so  it  suffices  to  show  that  H e U: 

Assume  that  this  fails,  so  that  B — H e U.  Then  for  any  * in  this  set,  there  is 
an  hix ) e B so  that  h{x)  c x and  x H/,(x >.  Let  j:  V < M = Ult(V,  U).  Then 
Ul\u  C y“L,  and  as  B c A so  that  |/i(x)|  < |x  n*|  for  x e B — H,  \[h]v\  < k by 
22.11(a).  Hence,  a simple  argument  using  a: -completeness  shows  that  there  is  an 
s e VKy  such  that  [h]v  = j{s).  But  for  any  x e B — H satisfying  hix)  — s we 
have  x <f  Hs,  and  this  contradicts  Hs  e U.  ~\ 

Proceeding  inductively  as  for  7.17,  this  argument  shows  that  U has  the  par- 
tition property  iff  for  any  n > 2,  whenever  /:  [VKy]f  -»  2,  there  is  a set  in  U 
homogeneous  for  /. 

Menas  moreover  found  a useful  property  that  implies  the  partition  property. 
For  U a normal  ultrafilter  over  VKy, 

XiU)  ff  for  some  / e kk,  [x  e VKy  \ f{\x  D at|)  = |x|)  e U . 

With  jij:  V ■<  M = Ult(F,  U),  /(!/)  translates  by  22.11(a)  into  the  assertion  that 
for  some  / e kk,  juif ){k)  = \y\.  That  /({/)  implies  that  U has  the  partition 
property  depends  on  the  following  lemma,  a derivative  of  Kunen’s  proof  of  his 
23.12: 

25.14  Lemma  (Solovay  [74:  371]).  For  U a normal  ultrafilter  over  VKy,  there  is 
an  E e U such  that  x C y both  in  E implies  that  |x|  < |y|. 

Proof.  Let  / be  w-Jonsson  for  y,  and  set 

E = [x  e VKy  \ /|[x]®  is  cu-Jonsson  for  x]  . 

Then  for  x c y both  in  E,  |x|  = |y|  implies  that  x = /“[x]®  = y.  Hence,  it 
suffices  to  show  that  E e U\ 

Let  j:  V ^ M = Ult(  V.  U).  Then  by  Los’s  Theorem,  it  suffices  to  show 
that  jif)\[j“y]m  is  «-Jonsson  for  j“y,  noting  that  we  do  not  need  to  distinguish 
between  V and  M as  rM  c M.  We  indeed  have  jif)“[j“y]co  c j“y  (as  in 
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Kunen’s  original  proof  of  his  23.12).  Suppose  now  that  y C j“y  with  |y|  = 
|j“y|  = |y|.  Setting  z — f“[z]w  = y since  / is  &>-Jonsson  for  y.  It 

follows  that  for  any  a < y,  there  is  an  s e [z]a  satisfying  a — f(s ),  so  that 
7(a)  = where  j(s)  e [y]®.  Hence  j(fY‘[y]w  = j“y.  H 

25.15  Proposition  (Menas  [76]).  For  U a normal  ultrafilter  over  VKy,  x(U) 
implies  that  U has  the  partition  property. 

Proof.  Let  / e kk  be  such  that  X = [x  e VKy  | /( |x  fl  k|)  = |x|]  e U.  With 

j:  V < M = Ult(Y,  U),  clearly  — f“tc  c k as  crit(y')  = k.  It  follows 

by  22.11(a)  that  Y — {x  e VKy  \ /“ \x  fl  k\  c |x  fl  at | } e JJ . Finally,  by  25.14 
let  E e U be  such  that  x c y both  in  E implies  that  |x|  < |y|.  Now  for  x c y 

both  in  X n Y fl  E e U,  since  |x|  < |y|,  |x  fl  k\  < |y  fl  k\  by  definition  of  X,  and 

moreover  /(|xfl«-|)  = |x|  < |yfl«-|  by  definition  of  Y.  Consequently,  U has  the 
partition  property  by  25.13.  H 

It  follows  that  for  y easily  definable  from  k,  like  y — k+  or  y — 2K,  every 
normal  ultrafilter  over  VKy  has  the  partition  property.  The  initial  observations 
about  the  partition  property  were  along  these  lines,  made  by  Solovay  assuming 
GCH.  If  y = k+  for  example,  then  the  function  / e K k given  by  /(a)  = a+ 
verifies  x(U)  for  any  normal  ultrafilter  over  Vkk+.  Donald  Pelletier  (see  Kunen- 
Pelletier  [83])  later  established  that  x(U)  is  strictly  stronger  than  the  partition 
property  for  U. 

Menas  used  the  property  x to  show  that  if  k is  supercompact,  then  for  every 
y > k there  is  a normal  ultrafilter  over  VKy,  in  fact  the  maximal  possible  number 
of  such  ultrafilters,  with  the  partition  property.  The  argument  is  as  for  22.14,  with 
a further  twist  loaded  into  Solovay’s  for  22.13: 

25.16  Exercise  (Menas  [76]).  Suppose  that  k is  2^*“  -supercompact.  Then  for  any 
A e V(V(VKy))  there  is  a normal  ultrafilter  W over  VKy  with  /(VF)  such  that 
A e Mw.  Hence,  there  are  22  ’1  normal  ultrafilters  over  V^y  with  the  partition 
property. 

Hint.  Assume  that  this  fails  for  some  y > K,  and  let  yo  be  the  least  such.  Following 
the  proof  of  22.13,  let  U be  a normal  ultrafilter  over  VK{ 2'yo'  ) with  corresponding 
j:  V -<  M = Ult(F,  U),  and  let  <p(vo,  v\,  V2)  be  such  that  <p[A , k , y]  iff  A e 
V(V(VKy))  and  for  any  normal  ultrafilter  W over  VKy  with  /(IF),  A £ Mw. 
It  follows  as  before  that  if  j0 : V -<  M0  = Ult(F,  U\yf),  then  by  the  choice  of 
yo,  Mo  |=  yo  is  the  least  ordinal  such  that  3uoip[K,  yo]-  A contradiction  results, 
once  it  is  verified  that  x (U  \ yo)-  But  if  / e kk  is  defined  by  /(a)  = \fi\  for  the  least 
ft  such  that  3vofi[a,  ft]  if  one  exists,  and  = 0 otherwise,  then  ju\y0(f)(K)  — I yol - 

H 


What  then  heightened  interest  in  the  partition  property  was  Solovay’s  discov- 
ery that  it  does  not  always  hold: 
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25.17  Proposition  (Solovay  - Menas  [76]).  Suppose  that  k < X < 8 and  both 
k and  X are  8-supercompact.  Then  there  is  a normal  ultrafilter  VKS  without  the 
partition  property. 

Proof.  Let  D be  a normal  ultrafilter  over  VK8,  and  for  each  x e V\8  with  x I)  k 
set 

D\x  = {{sfi.x  | s e Z]  | Z e D}  . 

Then  D\x  is  a normal  ultrafilter  over  VKx  (cf.  after  22.12).  Now  let  U be  a normal 
ultrafilter  over  Vfi,  and  define  W by: 

Xeff  iff  X c Vk8  a {x  e Vk8  \ x 2 k a XDVkx  e D\x]  e U . 

Then  W is  a normal  ultrafilter  over  VK8  (cf.  the  hint  for  22.9),  one  that  glues 
together  approximations  to  D using  U . D may  or  may  not  have  the  partition 
property,  but  W does  not: 

To  show  this,  it  suffices  by  25.13  to  take  an  arbitrary  X eW  and  find  .r  c y 
both  in  X such  that  |x|  > |y  D k\.  For  such  an  X,  set 

Y = {x  e Vk8  | x 2 k a ID  VKx  e D|x]  , 

and  for  each  x e Y set 


Zx  — [i1  g 7 ^k8  | s Pi  x G X } . 

Then  by  definition  of  W,  Y e U and  Zx  e D for  every  x eY . Now  let  x c y be 
both  in  Y,  and  by  unboundedness  let  s e Zj  fl  Zj  be  such  that  s fl  (y  — x)  0. 
Set  .r  = x fl  .5  and  y = y fl  s.  Then  x C y and  both  are  in  Z,yetasxny2/r 

\x\  = \x  n s|  > \k  n s\  = \y  n k n s|  = |y  n k\  . -\ 

Taking  8 = X,  if  k < X,  k is  A.-supercompact,  and  X is  measurable,  then  there 
is  a normal  ultrafilter  over  VKX  without  the  partition  property.  Just  after  Solovay 
established  this  result,  Kunen  (see  Kunen-Pelletier  [83])  improved  it  by  showing 
that  if  k < X,  a:  is  k-supercompact,  and  X is  ineffable  (a  property  consistent  with 
V — L discussed  in  volume  II),  then  there  is  a y < X and  a normal  ultrafilter  over 
VK  y without  the  partition  property.  Moreover,  he  showed  that  if  k is  supercompact 
and  there  is  a y > k so  that  some  normal  ultrafilter  over  VKy  does  not  have  the 
partition  property,  then  y is  Tl^-indescribable.  Hence,  it  is  consistent  that  there  is 
no  such  y. 

Further  results  about  the  partition  property  for  normal  ultrafilters  were  es- 
tablished in  Menas  [76],  Di  Prisco  [77],  Pelletier  [81],  Kunen-Pelletier  [83],  and 
Barbanel  [86,92,93,93a], 

Turning  to  the  more  general  propositions  Part  (at,  y)"  and  Part*(/c,  y)",  by 
Menas’s  25.16,  if  k is  2lyl<‘-SUpercompact)  then  Part* (at,  y)2  (and  in  fact 
Part*(/r,  y)n  for  every  n e to  by  the  comment  after  25.13).  Citing  this  for  one 
direction,  Magidor  established  the  following  combinatorial  characterization  of  su- 
percompactness: 
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25.18  Theorem  (Magidor  [74]).  k supercompact  iff  Part*(K,  y)2  for  every  y > k. 

It  was  recently  observed  that  the  forward  direction  has  a sharp  local  form, 
improving  what  can  be  deducted  from  25.16.  Shizuo  Kamo  [97,  02]  established:  If 
k is  y -supercompact,  then  there  is  a normal  ultrafilter  over  VKy  with  the  partition 
property  (and  hence  Part*(K,  y)2  holds).  See  Shioya  [oo]  for  a short  proof.  There 
is  a quick  argument  for  showing  that  if  k is  y -supercompact,  then  Part* (at,  y)2 
holds: 

Assume  to  the  contrary  that  the  proposition  fails  for  some  y > k,  and  let 
yo  be  the  least  such.  Let  U be  a normal  ultrafilter  over  VkYo  with  corresponding 
j:  V < M = Ult(V,  U).  Define  f:  k ->  k by:  /(£)  = 0 unless  § is  a regular 
uncountable  cardinal  and  Part*(f , y ) fails  for  some  y > £ , in  which  case  f(f ) = 
the  least  such  y.  Since  M is  closed  under  \yo \ <K -sequences  (22.11(b)),  it  follows 
that  j(f)(K)  — yo . But  then,  /((/),  so  that  by  25.15,  Part* (at,  y),  which  is  a 
contradiction. 

Magidor  also  established  an  intermediate  characterization  of  supercompact- 
ness in  terms  of  a concept  formulated  in  Jech  [73],  y-ineffability.  Di  Prisco  and 
William  Zwicker  in  their  [80]  refined  this  result,  and  moreover  provided  an  anal- 
ogous characterization  of  strong  compactness.  Through  such  means  Baumgartner, 
Carr  [87],  and  Di  Prisco  independently  showed  that  if  Part(Ar,  y f holds  for  eveiy 
y > k,  then  k is  strongly  compact.  The  exponent  here  was  recently  lowered  to  2, 
but  it  remains  unknown  whether  strong  compactness  yields  even  exponent  2: 

25.19  Question.  If  k is  strongly  compact,  does  Part(Ar,  y)3  hold  for  eveiy  y > k‘! 

25.20  Theorem  (Matet).  If  Part(Ar,  y)2  holds  for  eveiy  y > k,  then  k is  strongly 
compact. 

The  property  Part(Af,  y)2  generalizes  the  characterization  k — >•  (at)|  of  the 
weak  compactness  of  k.  Carr  [85]  considered  generalizations  of  other  character- 
izations, and  in  [86, 87, 87a]  pursued  their  study  in  considerable  detail.  (See  also 
Johnson  [88,88a].) 

Recent  work  has  been  directed  toward  shortcomings  in  the  analogical  treat- 
ment of  (PKy,  C)  in  terms  of  (at,  <).  Zwicker  [84]  defined  a stationary  coding 
set  for  VKy  to  be  a set  A stationary  in  VKy  together  with  an  injective  function 
c:  A ->  y such  that  x C y both  in  A implies  that  c(x)  e y.  He  showed  that 
restricting  the  study  of  (' PKy,  C)  to  such  a set  A results  in  more  features  anal- 
ogous to  (k,  <).  Shelah  [86:  §§7,9]  then  established  the  existence  of  stationary 
coding  sets  in  a variety  of  situations,  e.g.  when  k > a>2  is  a successor  cardinal 
and  y — k+ . Zwicker  [86]  contained  related  work,  and  his  [89]  subsequently 
used  stationary  coding  sets  to  begin  a study  of  structural  properties  of  ideals  over 
VKy  in  the  spirit  of  the  extensive  treatment  in  Baumgartner-Taylor- Wagon  [82] 
for  ideals  over  k. 

Carr-Pelletier  [89]  also  began  such  a study,  but  by  proceeding  analogously 
with  (' PKy,  <)  where  < on  VKy  is  defined  by: 


25.  Combinatorics  of  VKy 


351 


x < y iff  x c y A \x  \ < |y  fl  k \ . 

This  new  order  is  suggested  by  25.13,  and  was  first  used  systematically  in  Can- 
ts?]. The  following  exercise  provides  versions  of  previous  results  in  terms  of  <. 
If  (X a | a eVKy)  e v‘yViVKy),  then  its  <-dicigonal  intersection  is  [x  e VKy  I 

* e H a<x  For  a filter  F over  VkY* 

F is  strongly  normal  iff  F is  fine,  and  for  any  (Xa  \ a e VKy)  e V‘YF, 
its  < -diagonal  intersection  is  in  F. 

For  X C VKy,  a function  /:  X — > VKy  is  <-regressive  on  X iff  fix)  < x for 
every  x e X.  For  n e a>,  set 

[PicYTc  — {{xi x„ } | xi  < ...  <x„  all  in  VKy}  . 

For  functions  on  this  set,  that  a subset  is  homogeneous  has  the  expected  meaning 
from  the  context. 

25.21  Exercise  (Carr-Pelletier  [89]). 

(a)  A strongly  normal  filter  over  VKy  is  normal. 

(b)  A fine  filter  F over  VKy  is  strongly  normal  iff  whenever  X is  F -stationary 
and  f is  <-regressive  on  X,  there  is  an  F -stationary  Y C X such  that  f is 
constant  on  Y. 

(c)  A normal  ultrafilter  over  VKy  is  strongly  normal. 

(d)  If  U is  a normal  ultrafilter  over  VKy  and  /:  [VK  y]"  — > 2 for  some  n e &>, 
then  there  is  a set  in  U homogeneous  for  f . 

Hint.  For  (c)  and  (d)  recall  the  argument  for  25.13.  H 

Thus,  the  partition  property  for  [VKy]\,  unlike  for  [VKy]^-,  holds  for  all 
normal  ultrafilters.  On  the  other  hand,  the  approach  with  < is  less  general,  for 
Carr-Levinski-Pelletier  [90]  showed  that  for  some  (every)  y > k there  is  a strongly 
normal  filter  over  VKy  iff  k is  Mahlo  or  k — v+  where  v<v  = v. 

The  investigation  of  VK  y is  far  from  exhausted,  and  both  the  stationary  cod- 
ing set  and  < ordering  approaches  to  structural  analysis  hold  promise.  Roughly 
speaking,  the  first  approach  seems  more  appropriate  when  either  y is  small  with 
respect  to  k or  the  property  being  investigated  is  invariant  under  isomorphisms 
induced  by  mappings:  VKy  ->  VKy,  and  the  second,  when  k is  large,  and  either  y 
is  large  with  respect  to  k or  the  property  being  investigated  is  not  invariant  under 
isomorphisms.  For  ideals  over  VKy  a full  structure  theory  has  yet  to  be  worked 
out. 
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26.  Extenders 

As  a nod  to  modernity,  this  section  reaches  ahead  to  provide  a basic  analysis 
of  elementary  embeddings  and  related  hypotheses  that  was  to  become  crucial 
to  both  the  development  of  inner  model  theory  and  the  consistency  analysis  of 
strong  propositions  from  the  late  1970’s  onwards.  (Volume  II  proceeds  with  the 
orderly  elaboration  of  strong  hypotheses,  especially  with  regard  to  combinatorics 
and  relative  consistency,  carried  out  systematically  through  the  1970’s  and  after.) 

As  described  in  §20,  if  U is  a normal  ultrafilter  over  k,  then  L[U\  is  an  inner 
model  of  ZFC  in  which  k is  measurable.  Moreover,  these  models  have  strong 
canonicity  properties  and  establish  the  relative  consistency  of  minimal  principles 
like  GCH,  and  their  structural  coherence  provide  a persuasive  argument  for  the 
consistency  of  measurability.  A first  attempt  at  relativizing  strong  hypotheses 
to  develop  a comparable  inner  model  theory  would  be  to  analogously  take  a 
normal  ultrafilter  U over  VKy  and  consider  L[U\.  In  the  case  of  measurability, 
the  absoluteness  k fl  L = k of  the  underlying  set  for  measurability  is  implicitly 
exploited  in  the  results  on  canonicity.  However,  in  the  presence  of  a normal 
ultrafilter  U over  VKy  with  \y\  > K,VKyC\L  — VKy  fails,  and  indeed  VKyC\L  g U 
so  that  L[U ] — L (25.8). 

The  absoluteness  problem  can  be  dealt  with,  at  the  cost  of  weakening  the 
hypotheses  to  be  relativized,  by  considering  extenders,  sequences  of  ultrafilters 
over  absolute  sets  whose  corresponding  ultrapowers  form  a directed  system.  Ex- 
tenders had  occurred  in  a general  context  in  Powell  [74].  However,  they  did 
not  enter  large  cardinal  theory  until  after  Mitchell  [79]  discovered  a way  to  de- 
velop a canonical  inner  model  theory  for  weak  versions  of  y -supercompactness. 
Dodd  and  Jensen  (see  Dodd  [82])  then  arrived  at  the  extender  concept  as  an 
improvement  over  Mitchell’s  formulations,  and  first  used  the  term.  The  basic 
theory  of  extenders  and  related  hypotheses  is  developed  here,  these  scant  remarks 
alone  suggesting  their  crucial  role  in  the  emerging  inner  model  theory.  Unlike 
for  y -supercompactness,  it  is  the  method  of  analysis  rather  than  those  hypotheses 
characterizable  by  it  that  is  described  first,  as  it  provides  the  main  motivation. 

The  basic  discovery  is  that  any  elementary  embedding  between  inner  mod- 
els can  be  approximated  arbitrarily  closely  as  direct  limits  of  ultrapowers  with 
constructive  features  reminiscent  of  iterated  ultrapowers.  Proceeding  in  a general 
setting  that  anticipates  wide  applicability,  suppose  that 

N and  M are  inner  models  of  ZFC  and  j:  /V  -<  M . 

There  is  no  presumption  here  that  j is  a (definable)  class  of  N or  that  M C N. 
By  5.1(b),  j has  a critical  point. 

Let  f > k — crit(y'),  and  £ > k the  least  ordinal  satisfying  ft  < j(f)  . 

The  basic  case  is  f = k,  i.e.  f < j(K );  the  general  context  is  for  the  study  of 
very  strong  hypotheses.  For  each  a e define  Ea  by: 

XeEa  iff  Xe  nN  a aej(X). 
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This  is  yet  another  version  of  the  now  familiar  idea  of  generating  ultrafilters  from 
embeddings.  If  X e V([t; ]|a|)niV,  then  j(X)  e P([;(f)]|a|)nM,  so  that  querying 
a e j (X)  is  apropos.  Although  Ea  may  not  be  in  N,  (A,  e,  Ea)  \=  Ea  is  a k- 
complete  ultrafilter  over  [£]|a|.  Strictly  speaking,  Ea  is  a principal  ultrafilter  for 
a e [/c]<®,  but  for  notational  simplicity  conventions  and  concepts  are  hereby 
liberalized  to  allow  such  Ea  throughout  this  section.  Stipulate  that 

£ — (Ea  | a e [/S]<w)  is  the  (k,  f3)-extender  derived  from  j . 

For  a e [/3]<<u  let 

ja:  N < Ult(A,  Ea) 

be  the  usual  ultrapower  embedding.  It  is  simple  to  check  that  ka:  Ult(7V,  Ea)  ->  M 
defined  by 

*«((/)£.)  = 7 (/)(<*) 

(where  ( f)°E  e Ult(/V,  Ea)  is  the  equivalence  class  of  / e fi  N ) is  elemen- 

tary, and  that  ka  o ja  = j.  In  particular,  Ult(A,  Ea)  is  well-founded  and  so  has  a 
transitive  collapse  Ma. 

Ult(A,  Ea ) is  identified  with  Ma  . 

Next,  for  a c b both  in  [/6]<®,  say  b = {a\, ...  ,an]  with  the  usual  under- 
standing that  a\  < ...  <an  and  a = {a,-, , . . . , aim } where  1 < i\  < . . . < im  < n, 
define  7Tba:  [f]"  [C ]'”  by: 


Then  it  is  simple  to  check  that  iab : Ma  — >■  Mb  defined  by 

([./"] £„ ) — [/"  ° J^ba^Eh 

is  elementary,  and  that 

iab  ° ja  = jb  and  kb  o iab  — ka  . 

Finally,  {( Ma  \ a e [/S]<'u),  (iab  \ a c b))  is  seen  to  be  a directed  system,  so 
stipulate  that 

{Me,  eg)  is  the  direct  limit , 
and 

je-  (N,  e)  -<  {Ms,e£)  , 
kas-  {Ma,  e)  < {Ms,  e£)  , and 
k£ : {Me,  e£ ) -<  (M,  e) 

the  corresponding  embeddings,  so  that  for  a e [yS]<<u, 

k£  ° je  = j , ka£  o ja  = j£  , and  k£  o ka£  = ka  . 

In  particular,  {M£,  e£)  is  well-founded,  and  so  assume  that 

M£  is  transitive  and  e£=  e D (Ms  x Ms)  ■ 
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The  crucial  features  of  the  extender  analysis  of  elementary  embeddings  are 
given  by  the  following  lemma.  Its  (a)  provides  a useful  representation  of  Mg 
(cf.  19.6),  and  its  (b)  and  (c)  describe  to  what  extent  Mg  approximates  M.  Al- 
though there  are  at  most  22'cl  distinct  Ma’s,  it  is  how  they  are  interwoven  by  the 
iab  s that  leads  to  these  considerable  properties. 

26.1  Lemma. 

(a)  Mg  = {jg(fHa)  | a e [P]<a  a / e [^'N  IT  N}. 

(b)  For  any  y satisfying  \VY\M  < P-'  V™  C ran(£f),  V“£  = V^f , and 
kg(x)  — x for  x e V™£ . 

(c)  crit (kg)  > P,  and  so  crit (jg)  — k and  p < jg(^).  If  P — j(f  ),  then 
crit (kg)  > p,  and  so  P — jg(t;). 

Proof,  (a)  If  x e Mg,  there  is  an  a e [/S]<w  and  an  f e IT  N such  that 
x = ka£([f]Ea),  and  so 

kg(x)  = kg(ka£([f]Ea))  = ka([f]Ea)  = j(f)(a)  . 


Consequently, 

ran  (kg)  = {;(/)(«)  I « e [p]<a>  A / e [f]WlV  n N]  . 

The  result  now  follows  since  kg  o jg  — j. 

(b)  Let  g e n N be  such  that  whenever  < f,  g|[|  is  a 

bijection  between  [IVd^]1  and  Vf . Then  for  any  y satisfying  \VY\M  < P and 
any  x € V^f , there  is  a £ < P such  that  j(g)({^})  = x.  It  follows  from  the 
characterization  of  xzn(kg)  above  that  V ^ c ran(^),  and  the  rest  follows  since 
the  inverse  of  kg  is  just  the  collapsing  isomorphism:  ran  (kg)  -*  Mg. 

(c)  This  is  similar,  since  it  is  readily  seen  that  p C ran(A;,s).  Note  that  if 

P = j(X)  = kg(jg(X)),  then  in  fact  P + 1 c ran  (kg).  H 

Having  formulated  extenders  in  terms  of  an  ambient  elementary  embedding, 
the  essential  features  are  abstracted  in  a definition.  For  N an  inner  model  of  ZFC, 
k a cardinal  in  the  sense  of  N,  p > k,  and  £ — (Ea  \ a e [P~\<m),  £ is  an 
N-(k,  P)-extender  (and  simply  a (at,  P)-extender  if  N — V)  iff  for  some  f > k: 

(i)  For  each  a e [/8]<<u,  ( N , e,  Ea)  \=  Ea  is  a k -complete  ultrafilter 
over  [£]|fl|,  and: 

(1)  For  at  least  one  such  a,  ( N , e,  Ea)  |=  Ea  is  not  /r+-complete. 

(2)  For  each  § e f , there  is  such  an  a so  that 
{y  e m|fl|  | % e s}  e Ea. 

(ii)  (Coherence)  Suppose  that  a c b are  both  in  [P]<w  and 

Ttba-  [Cl1*1  — ► is  defined  as  before,  i.e.  \f  b = {a\, . . . , an}  and 

a = {ah,. . . , aim},  then  7rfofl({ft ft,})  = {ft, , . . . , ft,,,}.  Then 


X e Ea  iff  {s  | nba{s)  e X]  e Eh  . 
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(iii)  (Well-foundedness)  Whenever  am  e [/6]<<a  and  Xm  e Eclm  for  m e 
a>,  there  is  an  d:  [Jm  a,„  — > f such  that  d“am  e X,„  for  every  such 
m. 

(iv)  (Normality)  Whenever  a e [/S]<tt>,  / e n N,  and 

{s  e [C]|fl|  | f(s)  e ma x(j)}  e Ea  , 
there  is  a be  [/S]<®  with  a c b such  that,  with  m,a  as  in  (ii), 

{«  e [?f  I f(nba(s))  es)e  Eb  . 

Again,  f = k is  the  basic  case,  and  the  general  context  is  for  the  study  of  very 
strong  hypotheses.  For  (i),  the  Ea  ’s  are  allowed  to  be  principal.  (i)(l)  establishes 
the  role  of  k,  and  as  will  become  clear,  (i)(2)  specifies  the  f . It  is  simple  to  check 
that  for  any  a < P,  if  the  a ’s  from  [o']<'w  satisfy  (i)(l)  and  (i)(2),  then  £\ [a]<0)  is 
an  N-(k , a)-extender. 

We  now  proceed  to  use  these  clauses  to  construct  an  M£  and  a jg\  N < Mg 
as  before;  throughout,  7tba  is  as  in  (ii).  First,  for  each  a e [/S]<<u  we  can  by  (i)  form 
the  ultrapower  Ult(7V,  Ea).  By  (iii)  applied  with  each  am  — a,  Ea  is  countably 
complete  (as  defined  before  19.11,  i.e.  for  any  {Xn  \ n e a>}  c Ea,  <f\n  Xn  f 0). 
It  follows  that  Ult(A,  Ea)  is  well-founded  and  so  has  a transitive  collapse  Ma. 
Let 

ja : JV  <Ma=  Ult(A,  Ea)  . 

Next,  for  a c b both  in  [yS]<<u,  with  (ii),  iah\  Ma  -<  Mh  can  be  defined  as 
before  by 

iab([.f]Ea)  = \_f  ° Jtba^Eb  ■ 

so  that 

iab  ° ja  ~ jb  • 

Again,  ((Ma  \ a e [/8]<®),  ( iab  I a c b))  is  a directed  system,  so  stipulate  that 

{Mg,  eg)  is  the  direct  limit, 
and 

jg:  (N,  e)  -<  {Mg,  eg)  and 
ka\  {Ma,  e)  -<  {Mg,  eg) 

the  corresponding  embeddings  so  that  for  a c b both  in  [yS]<a>, 

K o ja  = je  and  kb  o iab  = ka  . 

It  is  straightforward  to  see  that  (iii)  implies  that  {Mg,  eg)  is  well-founded. 
(By  the  argument  for  15.7(a),  (iii)  in  fact  characterizes  well-foundedness.)  More- 
over, an  argument  as  for  19.3  shows  that  eg  is  set-like.  Flence,  we  can  assume 
(0.4)  that 
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M£  is  transitive  and  e£  = e fl  (Mg  x Mg). 

Extending  a previous  notation,  the  situation  is  summed  up  by 

jg:  N -<  Mg  = Ult(7V,  £)  , 

although  a single  ultrapower  is  not  being  taken.  As  usual,  subscripts  will  be 
suppressed  when  clear  from  the  context. 

(a)  of  the  following  lemma  is  based  on  (iv)  and  shows  why  it  was  called 
normality,  and  (b)  is  based  on  the  subsidiary  conditions  of  (i).  For  n e co, 
let  id,,:  [£]"  -»■  [£]"  be  the  identity  map.  Note  that  for  a c b both  in  [/J]<<B, 

fflZ>([id|<7|]£(J)  = \nba\Eb' 


26.2  Lemma. 

(a)  For  any  a € [f]<a>,  M[id|fl|]£„)  = a. 

(b)  crit(/V)  = k and  C is  the  least  ordinal  satisfying  B < j£(C). 

(c)  M£  = { j£(f)(a ) | a e m<a>  a fe  fl  N}. 

Proof,  (a)  This  is  first  established  for  a e [f]1  by  showing  that  £{a)([idi]£|0))  = {a} 
for  a < f by  induction  on  a.  So,  assume  that  this  holds  for  a < y where  y < ft. 
*{y)([idikM)  = {<5}  for  some  <5,  and  it  must  shown  that  8 = y. 

Consider  first  an  a < y,  and  set  a = {a,  y}.  For  each  s e [£]2,  7ra  („}(.?)  = 
{£q}  and  7ta{y)(s)  = {£js}  where  s = Iff  fj'}  with  §0S  < ff  It  follows  from  the 
definitions  that  the  unique  member  of  &{a}([idi]£M),  which  inductively  is  a,  is  less 
than  the  unique  member  of  klrj([idi]£|)/|),  which  is  8.  Flence,  y < 8. 

For  the  converse,  suppose  that  p < 8,  say  p — ka([f\Ea)  where  a e 
with  yea  and  f e fl  N . By  the  definition  of  8, 

(*)  {5  e | f(s)  is  less  than  the  unique  member  of  jr„{y)(^)}  e Ea  . 

By  condition  (iv)  there  is  a be  [/S]<£u  with  a c b such  that 

| f(7tba(s))es}eEh. 

It  follows  that  for  some  i, 

{s  e [?f 1 | f(jtba(s))  = (*),•}  e E„ 

where  (s)i  temporarily  denotes  the  ith  member  in  increasing  order  of  a set  s of 
ordinals.  If  (h),  = a , then 

| {f(7Tba(s))}  = nbla](s)}eEb. 

But  then,  condition  (ii)  applied  to  (*)  implies  that  Ttb^fs)  < na{a}(tXba(s))  for  s 
in  a set  in  £),,  so  that  a < y.  It  follows  from  the  definitions  and  induction  that 

{p}  = {ka([f]Ea)}  = *{„}([idikM)  = {<*}  . 


Flence  5 < y,  and  so  8 = y. 
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The  result  for  general  a e now  follows  easily:  Suppose  that  p — 

ki,([  f]Eb),  taking  a c b.  Then  as  in  the  above  arguments, 

p e ka([idlal]EJ  iff  [f]E„  e [7Tha]Eb 

iff  {[/]£„}  = [^b{a)]Eb  for  some  a e a 
iff  (pi  = M[idi]£w)  = {«}  for  some  a ea  . 

(b)  To  check  that  crit( j'^r)  = k,  that  jg(a)  — a for  a < k follows  by  a simple 
inductive  argument  based  on  the  “IV-/c-completeness”  of  the  Ea’s,  and  then  (i)(l) 
implies  that  for  some  a,  crit (jg)  = crit (ja)  — k. 

That  p < jg (£ ) follows  from  (a):  For  any  a < ft,  [idi]£|a|  e [7{«} (4T)]1  so 
that  k(„}([idi]£w)  e [jsif)]1,  and  A:{a)([idi]£M)  = {a}  by  (a). 

Finally,  if  f < t,,  then  (i)(2)  implies  that  for  some  a,  ja(f)  e [id|fl| ]Ea  so  that 
js(f)  e a c f by  (a),  and  so  indeed  f is  the  least  ordinal  satisfying  p < jsif). 

(c)  This  is  like  26.1(a):  If  x e Mg,  there  is  an  a e [/6]<w  and  an  f e '“'NON 
such  that  x = ka([f]EJ.  But  [ f]Ea  = ya(/)([id|a|]£fl),  and  so 

x = ka([f]EJ  = ka((ja(fma([idlal\Ea))  = jg(f)(a)  , 

the  last  equality  being  a consequence  of  (a).  H 

The  correlation  of  the  two  kinds  of  extenders  is  now  at  hand: 

26.3  Exercise.  Let  N be  an  inner  model  of  ZFC. 

(a)  Suppose  that  j:  N ■<  M with  crit (j)  = k,  and  £ is  the  (k,  P)-extender 
derived  from  j.  Then  £ is  an  N-(k , P)-extender. 

(b)  Suppose  that  £ — (Ea  \ a e [y6]<<u>  is  an  N-(k , P)-extender.  Then  the 
(at,  P)-extender  derived  from  jg:  N Mg  = Ult(lV,  £)  is  again  £. 

Hint,  (b)  applies  26.2.  H 

The  following  observations  are  as  for  ordinary  ultrapowers  (cf.  5.7).  Re- 
call from  §19  that  for  N a transitive  e-model  of  ZFC'  and  U and  S such  that 
{N,  e,U)  \=  U is  an  ultrafilter  over  S,  ( N , U)  is  weakly  amenable  iff  whenever 
FesNr\N,{ieS  \ F(i ) e U]  e N. 

26.4  Exercise.  Suppose  that  N is  an  inner  model  of  ZFC,  £ = (Ea  \ a e [ P~]<w ) 
is  an  N-(k , P)-extender  with  each  Ea  over  [f]^,  and  j:  N ■<  M = Ult(lV,  £). 
Then: 

(a)  j{x)  — x for  every  x e VK  C\  N.  If  ( N , Ea)  is  weakly  amenable  for  each 
a e [P]<<0,  then  VK  C\  N = VK  C\  M,  V(k)  (T  N = V(k)  (T  M,  and  k+n  = k+m . 

(b)  For  any  f j(f)  < (|^|  Cl  N\  ■ \P\)+- 

(c)  If  0 > P is  a cardinal  and  N |=  f is  a strong  limit  of  cofinality  greater 
than  if,  then  j{6)  — 0. 

(d)  For  any  set  X such  that  |X|  > f j“X  f M. 

(e)  £ <f  M. 
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Hint,  (a)  is  like  19.1(c),  (b)  like  19.7(a)  because  of  26.2(c),  (c)  like  19.7(c),  (d) 
like  22.4(c),  and  (e)  like  22.4(d).  H 

The  following  simple  observation  precludes  the  possibility  of  target  models 
for  extender  embeddings  having  global  closure  properties  in  general. 

26.5  Exercise.  Suppose  that  k < X,  X is  a strong  limit  cardinal  of  cofinality  u>,  and 
there  is  a j:  V -<  M with  crit(j)  = k and  V>_  C M.  Let  £ be  the  (k,  Id-extender 
derived  from  j and  jg:  V < Mg  = Ult(V,  £).  Then  wMg  g Mg. 

Hint.  V)  C Mg  by  26.1(b),  and  so  if  k < /J  < X,  the  (k,  /l)-extender  £ |[/J]<a)  e 
Mg.  But  £ ^ Mg  by  26.4(e).  H 

Strong,  Woodin,  and  Superstrong  Cardinals 

While  extenders  may  not  generate  inner  models  closed  under  the  taking  of  arbitrary 
co-sequences,  they  suggest  appropriately  tailored  hypotheses: 

k is  y -strong  iff  there  is  a j:  V < M such  that 

(a)  crit(y')  = k and  y < j (k)  , and 

(b)  VK+y  CM. 

k is  strong  iff  k is  y -strong  for  every  y . 
k is  superstrong  iff  there  is  a j:  V < M with 
crit(y ) = k and  Vj(K)  c M . 

Thus,  if  y < 8 and  k is  8-stivng,  then  k is  y-strong ; and  k is  measurable  iff  k 
is  0-strong  iff  k is  1-strong.  These  new  hypotheses  are  clearly  weak  analogues 
of  previous  ones:  if  k is  supercompact,  then  k is  strong ; and  if  k is  huge,  then 
k is  superstrong.  And  as  before,  although  the  3 j ranging  over  classes  is  not 
directly  formalizable,  there  will  be  characterizations  that  provide  a posteriori  for- 
malizations. By  these  means  the  superstrongness  of  k will  actually  be  seen  to 
be  consistency-wise  weaker  than  its  2K -supercompactness  (26.11(b)).  Gaifman 
[74:  85ff]  had  considered  strong  and  superstrong  cardinals,  but  they  did  not  be- 
come a focus  of  attention  until  Mitchell  [79]  and  Dodd  and  Jensen  (see  Dodd  [82]) 
showed  that  strong  cardinals  have  a canonical  inner  model  theory.  Concerning 
our  definition  of  y-strong,  the  specific  form  of  (b)  is  pertinent  only  for  small  y, 
since  y > k ■ co  implies  that  tc  + y = y,  and  has  some  nominal  advantages.  Also 
the  condition  y < j(K)  is  included  for  convenience  as  for  y -supercompactness;  it 
is  superfluous  when  y is  a successor  and  hence  for  full  strongness.  (See  26.7(b), 
which  gives  Gaifman’s  original  formulation;  if  j(i<)  < y,  then  k is  superstrong, 
a consistency-wise  stronger  assertion  than  being  strong  (26.12,26.13).)  Note  that 
y < yfic)  implies  that  |iy+y|  < j (k):  since  VK+Y  = V^_y  and  j(/c)  is  inaccessible 
in  M,  \VK+Y\  < |V„-+y|M  < j(K). 

Strongness  suffices  for  drawing  former  conclusions  deducible  with  reflection 
arguments  depending  only  on  having  sufficiently  large  Va  included  in  the  target 
model: 
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26.6  Exercise. 

(a)  If  there  is  a strong  cardinal,  then  V L{A)  for  any  set  A. 

(b)  If  k is  2-strong,  there  is  a normal  ultrafilter  U over  k such  that 

{a  < k | a is  measurable}  e U . 

(c)  If  k is  y -strong  and  2“  = a+  for  every  a < k,  then  2 P — fi+  for  every 
P < k + y. 

(d)  If  k is  strong,  then  VK  -<2  V. 

Hint.  For  (a),  assume  to  the  contrary  that  V — L(A ) for  set  A.  Let  j:  V < M 
with  crit(y)  = k and  A e M.  Then  M — L(A ) since  M is  an  inner  model  of 
L(A)  with  A e M.  But  then  j:  V -<  V,  contradicting  Kunen’s  23.12. 

(b)  follows  from  the  argument  for  22.1,  (c)  from  that  for  22.2,  and  (d)  from 
that  for  22.3.  H 

(a)  recalls  5.9,  but  the  relationship  between  strong  compactness  and  strong- 
ness is  unclear. 

These  observations  may  fuel  the  speculation  that  strongness  can  supplant 
supercompactness,  but  there  are  important  relative  consistency  results  based  on 
the  latter  for  which  the  former  does  not  seem  to  suffice. 

The  following  characterizations  are  in  the  spirit  of  the  ultrafilter  characteri- 
zation of  y -supercompactness: 

26.7  Exercise. 

(a)  k is  y -strong  iff  there  is  a (k,  \ VK+Y\+)-extender  £ such  that  VK+Y  C Mg 
and  y < js(K). 

(b)  k is  (y  + 1 )-strong  iff  there  is  a (k,  f)-extender  £ for  some  f > k such 
that  VK+Y+ 1 c Mg.  Consequently,  tc  is  strong  iff  for  any  set  x there  is  a j\  V -<  M 
with  crit(y)  = k and  x C M. 

(c)  k is  superstrong  iff  there  is  a ( k , f)-extender  £ for  some  f > k such  that 
VjsM  ^ Mg. 

Hint.  Apply  26.3(b)  and  26.1(b)(c).  For  (a),  note  that  if  j:  V -<  M witnesses 
the  y-strongness  of  k,  then  \VK+Y\  < \VK+Y\M  < |V^+y|+.  For  (b),  argue  as  for 
23.15(a)  that  the  condition  y < jg(K)  can  be  adjoined.  H 

((b)  can  be  slightly  strengthened  with  occurrences  of  y + 1 replaced  by  y, 
by  further  developing  the  theory  of  iterated  embeddings  broached  in  the  proof  of 
23.15(a)  to  get  a jm+x:  V < Mm+\  with  y < jw+l(K).) 

These  characterizations  lead  to  formalizations  in  ZFC,  since  clauses  of  the 
sort  Va  C Mg  can  be  rendered  in  terms  of  the  “extender  ultrapower”  of  a suf- 
ficiently large  Vp.  Such  clauses  can  be  further  embedded  by  formulating  the 
concept  of  a ( k , F)-extender  for  arbitrary  transitive  sets  Y:  Simply  replace  the  f 
in  the  definition  of  ( k , ft) -extender  by  Y,  replacing  t;  by  and  making  notational 
changes  to  accommodate  Y not  necessarily  having  a natural  well-ordering.  The 


360 


Chapter  5.  Strong  Hypotheses 


argument  for  26.2(a)  then  shows  that  Y is  included  in  the  target  model  Mg  of  the 
resulting  extender  embedding,  so  that  Va  c Mg  becomes  automatic  if  Va  c Y . 
Such  extenders  were  used  in  Martin-Steel  [89].  However,  the  simpler  Y = ft 
extenders  are  appropriate  for  the  inner  model  theory  because  of  the  absoluteness 
of  the  corresponding  f,  and  hence  of  the  underlying  [£]|fl|’s  for  the  Ea’s. 

The  following  analogue  of  22.8(b)  is  a consequence  of  26.7(a): 

26.8  Exercise.  If  Con(ZFC  + 3 k(k  is  strong)),  then  Con(ZFC  + 3k  (k  is  strong 

A -dkfA.  > k A A.  is  inaccessible))).  H 

In  contrast,  the  argument  from  1 -extendibility  given  after  23.3  shows  that 
there  are  many  measurable  cardinals  above  a superstrong  cardinal. 

We  can  deduce  as  for  y -supercompactness  that  f is  y -strong1  is  AfF  (cf.  be- 
fore 22.9),  and  as  for  hugeness  that  f is  superstrong1  is  H'fF.  This  leads  to  a 
direct  analogue  of  24.10: 

26.9  Exercise.  If  k is  strong  and  there  is  a superstrong  cardinal  above  it,  then 

there  are  k superstrong  cardinals  below  k.  In  particular,  if  there  are  strong  and 
superstrong  cardinals,  then  the  least  superstrong  cardinal  is  less  than  the  least 
strong  cardinal.  ~\ 

This  has  no  bearing  on  the  consistency  strength  of  superstrongness,  which  is 
stronger  than  that  of  strongness.  Before  establishing  this,  a further  hypothesis  is 
tucked  in: 

k is  Woodin  iff  for  any  f eKK,  there  is  an  a < k 

with  f“a  C a and  a j:  V < M with 
crit(y)  = a and  Vj(f)(a)  C M . 

This  concept  was  isolated  in  1984,  and  through  a series  of  remarkable  advances 
came  to  play  a central  role  in  consistency  analysis  (§32).  A more  technical  hy- 
pothesis, its  full  significance  can  only  be  appreciated  after  an  extended  exegesis. 
Note  that  as  for  26.7,  being  Woodin  is  characterizable  in  terms  of  the  existence  of 
extenders',  a refined  form  of  this  will  be  established  below  (26.14). 

26.10  Exercise.  Suppose  that  k is  Woodin.  Then  k is  regular  and  {a  < k \ a is 
measurable}  is  stationary  in  k,  and  so  k is  K-Mahlo. 

Hint.  For  the  regularity  of  k,  note  that  if  k were  singular  and  / e kk  were  such 
that  (/(|)  | £ < cf(/c)}  is  increasing  and  cofinal  in  k with  /( 0)  > cf (k),  there 
would  be  no  a < k such  that  /“a  C a.  H 

This  section  is  brought  to  a close  by  establishing  the  hierarchical  relationships 
among  the  new  hypotheses  and  providing  useful  characterizations  of  being  Woodin 
in  the  process. 
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26.11  Proposition. 

(a)  If  k is  l-extendible,  then  k is  superstrong,  and  there  is  a normal  ultrafilter 
U over  k such  that 

{a  < k | a is  superstrong}  e U . 

(b)  If  k is  2K  -supercompact,  then  there  is  a normal  ultrafilter  over  k such  that 

{a  < k | a is  superstrong}  e U . 

Proof,  (a)  Suppose  that  j : VK+\  -<  VpK)+ 1 witnesses  the  1-extendibility  of  k. 
Let  £ be  “the  ( k , j (k)) -extender  derived  from  j we  had  only  defined  extenders 
derived  from  embeddings  when  they  are  between  inner  models,  but  in  this  case 
we  can  still  define  the  Ef  s,  as  Pfl/c]1"1)  c VK+\.  It  is  simple  to  check  that  £ is 
indeed  a (at,  j (k)) -extender,  and  so  has  a corresponding  jg:  V -<  Mg  = Ult( V,  £). 
If  g e VK+\  is  a bijection:  [at]1  —>  VK,  then  j(g)  e VpK)+\  is  a bijection: 
[7  (at)]  1 ->  Vj(K),  so  that  the  argument  for  26.1(b)  shows  that  Vj(K)  C M.  Also, 
the  argument  for  26.1(c)  shows  that  jg(K)  — j(K).  Hence,  k is  superstrong  by 
the  characterization  26.7(c).  Finally,  £ e Vj(K)+ 1,  and  so  if  U is  the  normal 
ultrafilter  defined  from  j as  usual,  then  again  by  26.7(c),  mindful  of  how  the 
condition  Vj£  (K)  C Mg  can  be  expressed  in  terms  of  the  “extender  ultrapower”  of 
K , {a  < k | a is  superstrong}  e U. 

(b)  This  follows  from  (a)  and  23.5  for  r]  = 1.  H 

(b)  is  rather  striking,  in  that  a j:  V -<  M with  the  closure  condition  2"nUj'M  C 
M implies  the  consistency  of  the  existence  of  a j'\  V < M'  with  the  closure 
condition  V,'(Critov))  — M1;  /(crit(/))  is  only  determined  a posteriori  by  j'  and  is 
much  larger  than  crit(/),  so  that  a large  initial  segment  of  the  universe  is  absolute 
for  M' . This  serves  to  mitigate  against  a heuristic  argument  given  for  the  strength 
of  n -hugeness  just  after  that  concept  was  defined,  for  the  case  n — 1. 

In  contrast,  a real  strengthening  is  to  go  to  an  (n  + l)-huge  analogue  of 
superstrongness:  There  is  a j:  V < M with  crit(y')  = k and  Vj*+i(K)  C M.  This 
proposition  implies  the  n -hugeness  of  k by  the  argument  for  24.8. 

26.12  Proposition.  If  k is  superstrong,  then  k is  Woodin  and  there  is  a normal 
ultrafilter  U over  k such  that 

{a  < k | a is  Woodin}  e U . 

Proof.  Suppose  that  j:  V -<  M witnesses  the  superstrongness  of  k.  Towards 
verifying  that  k is  Woodin,  let  / e kk.  Then  j(f)  e ^K)j(K),  and  j (f)“K  — 
f“K  c k.  Let  £ be  the  (k,  |V/(/)(K)|M)-extender  derived  from  j.  Since  j (k)  is 
inaccessible  in  M and  j(f)(K)  < j(K),  it  follows  that  |V}(/)(k)|m  < j (k),  and  so 
it  is  simple  to  see  that  £ e Vj(K)  c M.  Moreover,  by  26.1(b),  Vj(f)(K)  c Mg  and 
j(f)  = kg(jg(f))  = jg(f),  so  that  VjE(f)(K)  c Mg.  It  follows  by  elementarity 
that  there  is  an  a < k with  f“a  c a and  an  (a,  j3)-ex tender  £’  e VK  for  some  £> 
such  that  Vje,(f)(c,)  c Mg'.  Hence,  k is  Woodin. 
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The  rest  of  the  proposition  now  follows  by  the  usual  reflection  argument, 
since  the  existence  of  the  requisite  extenders  in  VK  c M shows  that  M |=  k is 
Woodin.  H 

26.13  Proposition.  If  k is  Woodin,  then 

{a  < k | a is  /-strong  for  every  y < k] 


is  stationary  in  k. 

Proof.  Suppose  that  k is  Woodin,  and  C closed  unbounded  in  k.  A member  of  C 
must  be  found  /-strong  for  every  / < k. 

Remembering  that  k is  inaccessible  by  26.10,  define  g e kk  by: 


*(§)  = 


0 if  § is  /-strong  for  every  y < k , else 

/ where  / is  the  least  strong  limit  cardinal 
less  than  k such  that  £ is  not  /-strong  . 


Then  define  / e kk  by: 


/(?)  = ma x({g($)  + 5,  min(C  - $)})  . 


By  hypothesis,  there  is  an  a < k with  f“a  C a and  a j:  V < M with  crit(j)  = a 
and  Vj(  f)(a)  C M.  Now  j(C)  is  closed  unbounded  in  j{k:)\  j(C)C\a  — CHa;  and 
f“a  c a implies  that  C fl  a is  closed  unbounded  in  a.  Consequently,  a e j (C ). 
To  complete  the  proof,  it  suffices  by  elementarily  to  show  that  M \=  a is  /-strong 
for  every  / < j(K ): 

Let 

£=(Ea  | a e [j(g)(a)  + 1]""} 

be  the  (a,  j(g)(a)  + l)-extender  derived  from  j . By  elementarily, 


j(g)(a)  < j(f)(a)  < j (a)  , 

the  latter  inequality  by  /“a  C a and  a < j(a).  Consequently,  each  Ea  is  over 
[a]|a|,  and  j(g)(a)  < jsia)  by  26.1(c)  (in  its  terms,  crit(^£)  > j(g)(a)  + 1).  Also, 
j(g)(a)  is  a strong  limit  in  M and  Vj(fya)  C M,  so  that  by  26. 1(b), Vj(gya)  c Mg. 
Finally,  rank(£)  < j (g)(a)+5,  so  that  £ e V ,(/)(a)  c M.  Moreover,  the  foregoing 
properties  of  £ also  hold  in  the  sense  of  M: 

£ is  an  extender  in  M (note  that  the  functions  needed  for  the  well-foundedness 
condition  are  in  C M),  and  M -<  M f = Ult(M,  £)  is  a class  of  M 

definable  from  £.  jg(a)  is  the  ordertype  of 

{kaiUhJ  I fl  e [j(g)(a)  + 1]<W  A /:  [u]|fl|  ->  a} 

and  V^+\  = Va+i,  and  so  it  is  straightforward  to  check  that  the  computation 
of  jg(a)  is  absolute  for  M,  i.e.  jg  (a)  — jsia),  and  consequently  j(g)(a)  < 
jg(a).  Finally,  a similar  argument  about  the  “extender  ultrapower”  of  Va  shows 
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that  jg  (Va)  — js(Va),  and  so  the  consequence  Vj(g)(a)  — Vj^(a)  — js(Va)  of 
j(g)(a ) < js(a ) implies  that  V^)(B)  = Vj(gm  c M". 

All  this  confirms  that  M |=  a is  j (g) (a) -strong,  and  so  by  the  definition  of 
g,  M \=  a is  y-strong  for  every  y < j(ic).  H 

When  k is  inaccessible  and  f < k,  then  through  the  use  of  extenders  as  in 
the  above  proof,  f is  y -strong  for  every  y < k iff  VK  |=  £ is  strong.  Hence, 

26.13  establishes  transcendence  of  Woodin  cardinals  over  strong  cardinals.  An 
extension  of  its  argument  leads  to  a useful  characterization  of  Woodinness  in 
terms  of  a “boldface”  version  of  strongness.  For  any  set  A, 

k is  y -strong  for  A iff  there  is  a j:  V ^ M such  that 

(a)  crit(y')  = k and  y < j{K ) , 

(b)  VK+y  c M , and 

(c)  A n VK+y  = j (A)  n VK+Y  . 

There  is  an  obvious  analogue  of  26.7(a)  characterizing  y-strongness  for  A in  terms 
of  the  existence  of  extenders. 

26.14  Theorem  (Woodin).  The  following  are  equivalent: 

(a)  k is  Woodin. 

(b)  For  any  A C VK, 

{a  < k | a is  y-strong  for  A for  every  y < k} 


is  stationary  in  k. 

(c)  For  any  A C VK,  there  is  an  a < k y-strong  for  A for  every  y < k. 

(d)  For  any  f e kk  there  is  an  a < k with  f“a  C a and  an  extender  £ e VK 
with  crit (jg)  = a,  js(f)(oi ) = /(a),  and  VjeifKoi)  ^ Ms- 


Proof  (a)  — > (b)  k is  inaccessible  by  26.10.  Following  the  proof  of  26.13,  suppose 
now  that  k is  Woodin,  A c VK,  and  C is  closed  unbounded  in  k . Define  g e kk 
by: 

( 0 if  f is  y-strong  for  A for  every  y < k , else 


«(£)  = 


y where  y is  the  least  strong  limit  cardinal 

less  than  k such  that  § is  not  y-strong  for  A . 


Then  define  / e kk  by 


/«)  = max({g(|)  + 5,  min(C  - f )})  . 


By  hypothesis,  there  is  an  a < k with  /“a  C a and  a j : V -<  M with  crit(j')  = a 
and  Vj(f)( a)  c M.  As  before,  a e j(C),  and  it  suffices  to  show  that  M \=  a is 
y-strong  for  j (A)  for  every  y < J(KY- 

Again  let  £ be  the  (a,  j(g)(a)  + l)-extender  derived  from  j.  Then  the 
previous  results  about  £ and  M -<  Mg  = Ult (M,  £)  will  complete  the  proof, 
once  we  establish  the  further  property 
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mnv&m  = j?(j(A))  nv*)M. 

To  do  this,  it  suffices  to  show  that 

j(A)  n vjfa)  = j?U(A))  n jg(va) 

since  as  observed  before,  j(g)(a)  < j(a),  j(g)(a)  < jg(a),  and  Vf^)(a)  = 
Vj(g)(a)  c Mg.  But  note  that 

j(A ) n vfia)  = j(A  nva)  = je(A  n va)  = jg  (A  n va) , 

the  second  equality  from  26.1(b)  and  the  third,  by  an  absoluteness  argument  as 
before  since  Vg+X  = Va+i.  Furthermore,  A fl  Va  = j(A)  fl  Va  as  crit(y')  = a,  and 
so  the  chain  of  equalities  can  be  continued  with 

jg  (A  n va)  = jg(j(A ) n Va)  = jg(j  (A))  n jg(va) 

completing  the  proof. 

(b)  -*  (c)  This  is  immediate. 

(c)  — »■  (d)  Suppose  first  that  g e kk.  Then  by  hypothesis  there  is  an  a e k y- 
strong  for  jCV,  for  every  y < k.  This  implies  that  g“a  C a: 

For  f e k,  set  mg(%)  = max({^,  )})  + 3.  Suppose  now  that  £ < a.  Let 

j:  V < M with  crit(y)  = a,  g(£)  < j(a),  and  g n Vmg(S)  = j(g ) (T  Fm,,(r).  Since 
<?,#(£))  C g n Vmg(H),  it  follows  that  y'(g)(f)  = g($)  < j(u),  and  hence  that 
g($)  < «• 

The  argument  for  26.10  using  various  g’s  now  shows  that  k must  be  inac- 
cessible. 

To  complete  the  proof,  suppose  that  / e kk.  As  before,  let  a < k bey- 
strong  for  / C VK  for  every  y < k,  so  that  in  particular  /“a  c a.  As  we  now 
know  that  k is  inaccessible,  by  characterizability  through  extenders  (cf.  26.7(a)) 
there  is  an  extender  £ e VK  such  that  crit(y£)  = a,  Vf(a)  C Mg,  and  / IT  Vmf(a)  = 
Vmf(a)  (wherem/(a)  is  defined  as  in  the  second  paragraph).  But  as  before, 
je (/)(“)  = /(a),  and  so  the  result  follows. 

(d)  ->  (a)  This  is  immediate.  H 

Woodin  cardinals  are  best  viewed  in  terms  of  (c),  which  captures  their  essence 
as  a kind  of  reflection  property,  (d)  shows  that  k being  Woodin  is  a Fl\  property 
of  (VK,  e),  and  so  the  least  Woodin  cardinal  is  not  weakly  compact. 

26.14(c)  is  similar  to  the  characterizations  24.19  of  Vopenka’s  Principle, 
and  this  promotes  the  analogy  that  Woodin  cardinals  are  to  strong  cardinals  as 
Vopenka’s  Principle  is  to  supercompact  cardinals.  As  with  that  principle,  Woodin 
cardinals  can  be  further  analyzed  in  terms  of  corresponding  filters:  For  X C.  k, 

X is  Woodin  in  k iff  for  any  / e kk,  there  is  an  a e X 

with  f“a  C a and  a j:  V ~<  M with 
crit(y)  = a and  Vj(  f)(a)  C M . 
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It  is  simple  to  check  that  k is  Woodin  iff 

{X  C k | k — X is  not  Woodin  in  /c} 

is  a (proper)  filter,  the  Woodin  filter  over  k. 

The  following  propositions  are  analogous  to  those  for  Vopenka’s  Principle, 
but  the  proofs  are  less  straightforward  at  some  points. 

26.15  Exercise.  Suppose  that  k is  Woodin,  and  F is  the  Woodin  filter  over  k. 

(a)  F is  normal. 

(b)  For  any  A C.  VK,  {a  < k \ a is  y-strong  for  A for  every  y < k]  e F. 

(c)  If  X e F,  {a  < k \ a is  measurable  and  there  is  a normal  ultrafilter  over 
a containing  X IT  a}  e F. 

Hint,  (a)  recalls  24.14.  For  (b),  argue  as  for  26.14(a)  ->  (b),  but  with  a j that 
also  satisfies  g(a)  — j(g)(a);  this  is  possible  by  the  argument  for  26.14(c)  -> 
(d).  (c)  recalls  24.16  and  uses  26.14(c).  H 

In  the  context  of  inner  model  theory  Schimmerling  [02]  provided  a further 
characterization  of  Woodin  cardinals  and  went  on  to  formulate  the  hyperWoodin 
cardinals  and  the  weakly  hyperWoodin  cardinals.  These  cardinals  fit  into  the 
large  cardinal  hierarchy  in  the  following  increasing  order  of  strength:  measurable 
Woodin,  weakly  hyperWoodin,  hyperWoodin,  and  superstrong.  A cardinal  k is 
hyperWoodin  iff  it  is  Woodin  and  there  is  a normal  ultrafilter  over  k extending 
the  Woodin  filter. 

With  this  look  at  extenders  and  related  hypotheses  we  have  peered  ahead 
at  trappings  of  major  advances  of  the  1980’s.  These  were  made  over  a broad 
front:  dramatic  progress  in  inner  model  theory  in  conjunction  with  crowning 
achievements  in  the  concerted  investigation  of  a hypothesis  that  arose  in  the  1960’s 
from  an  entirely  different  quarter,  the  Axiom  of  Determinacy. 


Chapter  6 

Determinacy 


The  investigation  of  the  determinacy  of  games  is  perhaps  the  most  distinctive 
and  intriguing  development  of  modern  set  theory,  and  the  correlations  eventually 
established  with  large  cardinals  the  most  remarkable  and  synthetic.  Although 
focused  on  sets  of  reals,  the  subject  was  to  expand  across  the  breadth  of  set  theory 
from  combinatorics  and  forcing  to  large  cardinals  and  inner  model  theory.  As  a 
topical  and  anticipatory  conclusion  to  the  present  volume  this  chapter  describes 
this  development.  §27  discusses  the  historical  beginnings  of  the  study  of  infinite 
games  and  the  early  work  that  led  to  the  formulation  of  determinacy  hypotheses. 
§28  starts  with  Solovay’s  seminal  result  on  the  connection  with  large  cardinals  and 
proceeds  to  develop  the  combinatorial  theory  in  that  direction.  §29  and  §30  explore 
the  structural  consequences  of  determinacy  in  descriptive  set  theory,  a direction  of 
investigation  first  pursued  by  Moschovakis  and  Martin.  §31  starts  the  discussion 
the  consistency  of  determinacy  hypotheses,  with  Martin’s  groundbreaking  work. 
Finally,  §32  gives  a panoramic  survey  of  recent  relative  consistency  results  of 
Martin,  Steel,  and  especially  Woodin. 
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27.  Infinite  Games 

With  roots  in  the  analysis  of  actual  games,  the  concepts  of  infinite  game,  winning 
strategy,  and  determinacy  (i.e.  having  a winning  strategy)  have  become  a rich 
paradigm  for  the  articulation  of  dichotomies  across  the  breadth  of  set  theory.  In 
light  of  the  first  results  along  these  lines,  Polish  mathematicians  in  1962  introduced 
the  Axiom  of  Determinacy  (AD),  postulating  for  every  set  of  reals  the  determinacy 
of  a corresponding  game,  largely  because  it  established  the  regularity  properties 
(§12)  for  all  sets  of  reals.  With  the  conclusion  contradicting  the  Axiom  of  Choice, 
AD  was  initially  cast  adrift  in  the  sea-change  of  set  theory  brought  on  by  the 
development  of  forcing  and  the  gathering  results  about  large  cardinal  hypotheses. 
On  the  other  hand,  after  Solovay  established  in  1965  that  the  regularity  properties 
for  the  Ej  sets  follow  from  the  existence  of  a measurable  cardinal  (§14),  there  was 
a predisposition  to  consider  the  liberal  use  of  strong  hypotheses  to  settle  questions 
of  descriptive  set  theory. 

Two  events  in  1967  stimulated  the  further  research  that  soon  transformed 
the  investigation  of  determinacy  into  a mainstream  of  set  theory.  The  first  was 
Solovay’s  demonstration  that  AD  implies  the  measurability  of  to i,  which  anchored 
unexpected  consequences  for  the  transfinite  in  the  emerging  theory  of  large  cardi- 
nals. The  second  was  the  appearance  of  a new  and  elegant  game  proof  by  David 
Blackwell  [67]  of  a result  of  classical  descriptive  set  theory,  which  secured  a 
mooring  for  a systematic  approach  to  extending  the  methods  and  results  of  that 
theory  based  on  determinacy. 

Acknowledging  an  apparently  strong  hypothesis  based  on  a new  paradigm 
and  attracted  by  the  possibility  of  developing  more  consequent  structure  about 
the  transfinite  cardinals  and  definable  sets  of  reals,  a host  of  fine  mathematicians 
soon  came  on  board  and  carried  out  a concerted  effort:  Solovay,  Martin,  and 
Moschovakis,  joined  by  Kunen  and  Kechris  by  the  early  1970’s.  A wide  array 
of  techniques  ranging  from  the  Recursion  Theorem  to  ultrapowers  was  brought 
to  bear  to  systematically  substantiate  an  often  strange  and  striking  but  internally 
coherent  web  of  connections  imposed  by  AD  and  other  determinacy  hypotheses. 
That  these  hypotheses  had  no  particular  justification  was  widely  acknowledged,  but 
their  consequences  as  well  as  the  modes  of  argument,  particularly  in  descriptive 
set  theory,  were  thought  to  be  considerable  motivation  for  their  further  study. 

By  the  later  1970’s  a more  or  less  complete  theory  for  the  projective  sets  was 
in  place,  a resilient  edifice  founded  on  determinacy  with  both  strong  buttresses 
and  fine  details.  (There  was  one  notable  lacuna,  the  determination  of  the  so-called 
projective  ordinals.  This  was  to  be  carried  out  in  a culminating  feat  of  technical 
virtuosity  by  Steve  Jackson.)  With  the  study  per  se  thus  vindicated  the  researchers 
started  the  Cabal  Seminar  in  the  Los  Angeles  area,  and  particularly  with  two  fine 
younger  mathematicians,  Steel  and  Woodin,  joining  the  ranks  of  that  seminar, 
attention  began  to  shift  to  sets  of  reals  beyond  the  projective  sets,  to  inner  models, 
and  to  questions  of  overall  consistency. 
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From  the  beginning,  the  investigation  of  determinacy  not  only  involved  its 
consequences,  but  how  much  determinacy  is  derivable.  Early  on  Martin  had 
established  another  synthetic  connection  between  determinacy  and  large  cardinals 
by  showing  that  the  existence  of  a measurable  cardinal  implies  a significant  amount 
of  determinacy.  In  1974,  complementing  early  work  of  Harvey  Friedman,  Martin 
essentially  established  just  how  much  determinacy  is  derivable  in  ZFC,  and  several 
years  later,  building  on  work  of  Solovay  and  Harrington  he  established  an  exact 
correlation  between  amounts  of  determinacy  and  the  existence  of  many  measurable 
cardinals. 

With  its  seemingly  boundless  strength,  does  determinacy  subsume  the  large 
cardinal  hierarchy?  Or  are  all  the  determinacy  hypotheses  derivable,  after  all,  from 
large  cardinals?  Are  the  two  somehow  orthogonal  in  the  higher  reaches?  How 
this  central  problem  was  eventually  resolved  in  the  later  1980’s  has  elements  of 
an  exciting  mystery  novel:  the  spurts  and  lulls,  the  red  herrings  and  sudden  leads, 
the  long-distance  phone  calls,  and  once  the  parts  of  the  puzzle  began  to  fit  into 
place,  the  orderly  denouement.  Large  cardinals,  and  relatively  modest  ones  at  that, 
do  provide  the  requisite  strength  as  shown  by  Martin  and  Steel,  and  ultimately  by 
Woodin,  in  remarkable  equiconsistency  results.  This  was  a resounding  triumph  for 
the  modem  methods  of  set  theory  and  an  unforseen  affirmation  of  the  relevance 
of  large  cardinals. 

This  overview  has  gone  far  beyond  our  historical  frame,  and  to  provide  an 
adequate  account  of  the  foregoing  would  go  far  beyond  the  scope  of  the  present 
volume.  The  approach  that  will  be  taken  is  as  follows:  This  section  describes 
the  historical  beginnings  of  determinacy  and  the  early  work  that  led  to  the  for- 
mulation of  determinacy  hypotheses.  The  next  pursues  the  more  combinatorial 
consequences  of  determinacy,  particularly  those  of  large  cardinal  character.  §29 
and  §30  explore  the  structural  consequences  of  determinacy  hypotheses  in  descrip- 
tive set  theory.  §31  discusses  the  early  work  on  the  consistency  of  determinacy 
hypotheses,  and  finally  §32  surveys  the  crowning  achievements  of  the  1980’s  in 
this  direction.  By  then,  most  proofs  will  have  been  dispensed  with  in  favor  of  a 
coherent  account  of  the  overall  context  and  the  historical  progression  of  ideas.  The 
text  Moschovakis  [80]  serves  as  the  reference  for  the  decriptive  set  theory  part,  and 
§§28, 3 1, 32  play  roles  complementary  to  it,  the  first  in  emphasizing  combinatorics 
and  large  cardinals  and  the  latter  in  focusing  on  relative  consistency. 

As  much  of  the  work  in  determinacy  must  proceed  without  AC, 

ZF  serves  as  the  ambient  theory  for  this  section  , 

and  uses  of  AC  will  be  explicitly  noted,  reversing  the  usual  procedure. 

To  establish  the  basic  context,  let  A be  a non-empty  set.  For  A c "A,  GX(A) 
denotes  the  following  “infinite  two-person  game  with  perfect  information”:  There 
are  two  players,  I and  II.  I initially  chooses  an  x(0)  e A;  then  II  chooses  an 
x(l)  e A;  then  I chooses  an  x(2)  e A;  then  II  chooses  an  x(3)  e A;  and  so 
forth: 
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I : x(0 ) x(2) 

II  : x(l)  x(3) 

Each  choice  is  a move  of  the  game,  and  each  player  before  making  each  of  his 
moves  is  privy  to  all  the  previous  moves  (“perfect  information”).  The  resulting 
x e ® X is  a play  of  the  game,  an  initial  segment  of  x a partial  play,  and  I wins 
if  x e A,  and  otherwise  II  wins.  A is  the  payoff  for  the  game  Gx(A). 

A strategy  for  I is  a function  a:  [Jn€lo2r,X  — > X that  tells  him  what  move 
to  make  given  the  previous  moves,  so  that  a (partial)  play  according  to  c r is  a 
(partial)  play  of  form 

/:  er(0)  or«er(0),  v (0)))  <x«<r(0),  y(0),  <r«<r(0),  y(0)»,  y(l)» 

II-  y( 0)  yd)  ... 

With  IPs  moves  thus  enumerated  by  y e WX,  this  play  is  denoted 

<7  * y . 

a is  a winning  strategy’  for  I iff  {a  *y  \ y e “I)  C A,  i.e.  no  matter  what  moves 
II  makes,  plays  according  to  a always  result  in  a member  of  A.  Analogously,  a 
strategy  for  II  is  a function  r:  {J„€m2n+xX  — > X;  and  a (partial)  play  according 
to  x is  a (partial)  play  of  form 

/ : z(0)  z(l)  ... 

II:  r«z(  0)»  t((z(0),t((z(0)»,z(1)» 

for  some  z e WX,  denoted 

z * r . 

r is  a winning  strategy’  for  II  iff  {z*  r \ z e “X)  fl  A = 0.  Note  that  for  any 
strategy  a for  I,  the  function  assigning  y e mX  to  a * y e “ X is  injective,  so  that 
if  | A | < \X\M,  then  I cannot  have  a winning  strategy;  analogous  remarks  apply  to 
II  and  \mX  — A|.  Finally, 

Gx(A)  is  determined  iff  a player  has  a winning  strategy  . 

Note  that  the  players  cannot  both  have  winning  strategies.  For  brevity  in  the  main 
case  of  X — a>  and  A c aco, 

A is  determined  iff  Gm(A)  is  determined  . 

The  following  notation  is  helpful  in  this  setting:  For  x e WX,  define  xj,xu  e 
C0X  by 

xj(i)  — x(2 i)  , and 
Xn(i)  — x(2 i + 1)  . 

If  a is  a strategy  for  I as  above  and  y e 10 X,  then  (cr  * y)//  = y,  and  if  r is  a 
strategy  for  II  and  z e WX,  then  (z*t)i  = z. 

The  games  Gx(A)  are  taken  to  be  paradigmatic,  but  various  games  will 
also  be  considered  with  more  involved  rules.  The  foregoing  terminology  adapts 
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naturally,  and  will  be  applied  without  further  explanation.  Sometimes  a “payoff 
scheme”  is  described  without  making  explicit  a corresponding  set  A,  but  this  will 
be  simple  to  do  in  each  case. 

The  determinacy  of  games  having  emerged  in  modem  set  theory  as  a major 
area  of  investigation,  it  is  notable  that  important  figures  in  foundational  studies  first 
entertained  the  possibility  of  an  analysis  of  extant  games  and  game-like  contexts. 
In  an  early  adumbration  focusing  on  chess  Zermelo  [13:502-503]  argued  that  if 
from  a position  q a player  can  force  a win  at  all,  then  there  is  a t (q)  e w such  that 
he  can  win  in  at  most  t(q)  moves  no  matter  how  his  opponent  plays.  This  was 
later  taken  up  by  Hungarian  mathematicians:  In  the  paper  that  introduced  his  well- 
known  tree  lemma  Konig  [27]  filled  a gap  in  Zermelo’s  argument  using  that  lemma, 
following  a suggestion  of  von  Neumann.  At  the  end  Konig  [27:  130]  gave  a simple 
argument  communicated  to  him  by  Zermelo  that  also  filled  gap;  Zermelo  was 
arguing  in  effect  that  finite  versions  of  the  games  Gx(A)  are  determined.  Konig 
considered  an  infinite  chessboard,  and  Laszlo  Kalmar  [28]  pursued  a transfinite 
generalization. 

Before  this,  Borel  [21]  had  described  a wide  range  of  finitary  game-theoretic 
problems,  introduced  a general  concept  of  strategy,  and  formulated  the  now  fa- 
miliar minimax  strategy.  Then  von  Neumann  [28]  proved  the  crucial  minimax 
theorem,  the  result  that  really  began  the  mathematical  theory  of  games.  A simple 
consequence  of  that  theorem  is  again  that  finite  versions  of  the  games  Gx  (A)  are 
determined.  Focusing  on  chess  Max  Euwe  [29]  had  also  described  the  minimax 
strategy;  he  was,  incidentally,  the  world  champion  of  chess,  1935-1937. 

Speculating  about  possible  consequences  for  sets  of  reals  Polish  mathemati- 
cians first  contemplated  infinite  games  involving  reals.  Following  another  antic- 
ipation of  the  theory  of  games  by  Hugo  Steinhaus  [25],  Banach  and  Stanislaw 
Mazur  in  1925  (according  to  Steinhaus  [65:464])  found  a non-determined  game 
of  the  sort  GX(A)  (using  AC;  cf.  27.2).  In  the  Scottish  book  of  problems  (see 
Mauldin  [81:  113 ff] ) Mazur  described  an  infinite  game  and  conjectured  its  con- 
nection to  Baire  category,  a conjecture  confirmed  by  Banach  in  1935.  Modifying 
the  problem  Ulam  essentially  asked:  For  which  A C a>  does  I (or  respectively 
II)  have  a winning  strategy  in  Go,  (A)?  It  was  to  take  set  theorists  half  a century 
to  provide  a fair  answer  to  a related  question:  For  which  A c wco  does  either  I 
or  II  have  a winning  strategy  in  G(U(A),  i.e.  is  Gm(A)  determined? 

By  the  mid-1940’s  von  Neumann  and  Oskar  Morgenstem  in  their  [44]  had 
codified  the  theory  of  games  and  its  approach  to  the  analysis  of  economic  behavior, 
stimulating  research  in  this  direction  for  decades  to  come.  It  was  against  this  back- 
drop that  a more  systematic  investigation  of  the  determinacy  of  infinite  games 
began. 

David  Gale  and  Frank  Stewart  initiated  the  study  of  the  games  Gx(A).  For 
non-empty  X and  .v  e <WX  let  O(s)  — {x  e WX  \ s c x).  Ac  WX  is  open  iff  it  is  a 
union  of  such  0(s)’s,  and  is  closed  iff  WX  — A is  open.  With  the  discrete  topology 
on  X,  these  are  just  the  concepts  of  open  and  closed  in  the  product  topology, 
consistent  with  the  case  X = a>.  The  following  result  is  simple  yet  fundamental 
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for  the  whole  subject;  the  argument  is  as  for  related  finite  games  (essentially  given 
by  Zermelo  and  von  Neumann  as  described  above)  as  membership  in  open  sets  is 
secured  at  a finite  stage. 

27.1  Proposition  (AC)(Gale-Stewart  [53]).  If  A C "X  is  either  open  or  closed, 
then  Gx(A ) is  determined. 

Proof.  For  any  B C "X  and  s € <0JX,  let 

B/s  = {x  e 0JX  | e B)  . 

If  I has  no  winning  strategy  in  Gx(B/s),  then  for  any  i e X there  is  a j e X 
such  that  I has  no  winning  strategy  in  Gx{B/s^{i,  j )):  Otherwise,  let  i e X be 
such  that  for  every  j e X,  I has  a winning  strategy  Oj  in  Gx(B/s^(i,  j)).  Then 
I would  have  a winning  strategy  in  Gx(B/s ) after  all:  Initially  make  the  move 
i , and  after  any  reply  j by  II,  play  according  to  oj. 

Suppose  now  that  A C WX  is  open,  and  assume  that  I has  no  winning 
strategy  in  GX{A).  Then  by  the  above  remark,  a strategy  r for  II  can  be  defined 
recursively  so  that  for  any  partial  play  s e Un£C02"X  according  to  r,  I has  no 
winning  strategy  in  Gx(A/j).  Now  if  jc  were  a play  according  to  r yet  x e A, 
then  by  openness  there  would  be  a 2 n e co  such  that  0(x\2n)  C A.  But  then,  any 
strategy  for  I in  Gx(A/x\2n)  would  be  a winning  one,  reaching  a contradiction. 
Hence,  r is  a winning  strategy  for  II  in  GX(A). 

The  argument  for  closed  A is  analogous,  with  the  roles  of  I and  II  inter- 
changed. H 

As  in  this  proof,  an  informal  description  of  a strategy  will  be  typically  given 
without  bothering  to  write  out  a formal  definition.  Although  the  main  case  is 
X — co,  proofs  of  determinacy  often  turn  on  the  determinacy  of  auxiliary  games 
corresponding  to  open  A C “X  for  some  X,  the  crux  of  the  matter  being  that 
membership  in  A is  secured  at  a finite  stage. 

The  full  exercise  of  AC  delimits  the  possibilities: 

27.2  Proposition  (AC)(Gale-Stewart  [53]).  There  is  a set  of  reals  which  is  not 
determined. 

Proof.  As  there  are  2^°  strategies  for  games  of  form  GW(A),  let  {cra  \ a < 2^°) 
enumerate  the  strategies  for  I and  (ra  \ a < 2K°)  those  for  II.  Recursively  choose 
aa,  ba  € 10 co  for  a < 2K°  as  follows:  Having  chosen  ap  and  bp  for  < a,  choose 
ba  so  that  ba  — oa  * y for  some  y yet  ba  ^ {cip  \ < a}.  This  is  possible  since 

|{o’a!  * y | y e “&>}|  = 2Ko,  the  function  taking  y e “sj  to  oa  * y being  injective. 
Similarly,  choose  aa  so  that  aa  = z * ra  for  some  z yet  aa  {bp  \ /3  < a}.  It  is 
simple  to  check  that  the  resulting  sets  A = { aa  \ a < 2K°]  and  B — {ba  \ a < 2K°] 
are  disjoint,  and  that  neither  I nor  II  has  a winning  strategy  for  G(U(A).  H 

Gale-Stewart  [53]  also  asked  the  basic  question  whether  every  Borel  set  of 
reals  determined.  By  27.1  open  sets  are  determined;  Philip  Wolfe  [55]  proved  that 
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sets  are  determined;  and  almost  a decade  later,  Morton  Davis  [64]  proved  that 
£3  sets  are  determined.  The  emerging  difficulties  were  to  be  explained,  and  further 
progress  to  be  stimulated,  by  metamathematical  arguments  (see  after  3 1 .2).  Prior 
developments  are  discussed  next  that  motivated  the  formulation  of  new  hypotheses 
in  set  theory. 

Games  and  Regularity  Properties 

Putting  the  regularity  properties  for  sets  of  reals  in  a new  light,  successively 
considered  are  three  games  closely  related  to  the  Baire  property,  the  perfect  set 
property,  and  Lebesgue  measurability. 

For  A C "a)  Mazur’s  Scottish  book  game  mentioned  earlier  is  essentially  the 
game  G**(A)  formulated  as  follows:  There  are  two  players  I and  II,  but  instead 
of  choosing  members  of  &>  they  choose  members  of  <coa>  — {0}: 

I : s0  s2 

II:  si  S3 

The  previous  terminology  adapts,  and  with  x the  concatenation  SfPsi^^^  ...  of 
the  resulting  play,  I wins  if  x e A,  and  otherwise  II  wins.  For  the  following, 
Mazur  initially  noted  the  forward  directions,  Banach  provided  the  converse,  and 
John  Oxtoby,  a generalization  to  arbitrary  topological  spaces. 

27.3  Proposition  (Mazur,  Banach  - Mauldin  [81:  113ffj;  Oxtoby  [57]).  For  A C 

mco, 

(a)  A is  meager  iff  II  has  a winning  strategy’  in  G**(A). 

(b)  0(s)  — A is  meager  for  some  s e <a>co  iff  I has  a winning  strategy  in 

G*W*(A). 

Proof,  (a)  Suppose  first  that  A is  meager,  so  that  A c [Jn€lo  Cn  where  each  Cn  is 
closed  and  nowhere  dense.  Then  a strategy  r for  II  can  be  defined  so  that  r((so>) 
is  some  t such  that  O(so"~f)  fl  Co  = 0,  and  generally  r((so,  si, . . . , s2n))  is  some 
t such  that  O(so~si'~' . . . ''s2lf't)  (T  C„  — 0.  Such  a r is  winning  for  II. 

Suppose  now  that  II  has  a winning  strategy  r . For  each  partial  play  according 
to  r of  form  p = (s0, . . . , s2n),  let  p.t  — sq"'  . . . '~'s2n  and  set 

Dp  — [x  e "ft>  | p*  c x ->•  3t  e <coa>  - {0}(p*~f~r(p"'(f})  c x)}  . 

Then  each  Dp  is  open  (noting  in  part  that  0(pt)  is  clopen)  and  dense  (for  if 
a e <0Ja>,  either  u 2 P*  so  that  O(u)  c Dp,  or  else  there  is  a t e <a>u>  — [0] 
such  that  p^t  = u and  so  any  x e wa>  with  x 2 P*^t^T(p~  (t))  satisfies 
x e 0(u)  fi  Dp).  Moreover,  for  any  x e R pDp  we  can  recursively  define  a 
play  {.Sj  | i e a>)  according  to  r such  that  x = and  so  x ^ A. 

Consequently,  A C (J pf°u>  — Dp),  a countable  union  of  nowhere  dense  sets. 

(b)  Suppose  first  that  0(s)  — A is  meager  for  some  s e <wco,  which  can  be 
taken  to  be  non-empty.  Then  I has  a winning  strategy  beginning  with  initial  move 
s and  then  playing  to  avoid  GO  — A as  in  the  forward  direction  of  (a). 


374 


Chapter  6.  Determinacy 


For  the  converse,  if  I has  a winning  strategy  er,  say  with  a(0)  = s,  then  it  is 
simple  to  see  that  II  has  a winning  strategy  in  G™(0(s)  — A)  derived  from  a, 
and  so  0(s ) — A is  meager  by  (a).  H 

27.4  Corollary.  For  A C :"o>,  setting 

= |J{(9(j)  | 5 e <10cl>  A C> (a)  — A is  meager}  , 

if  G**(A  — Oa  ) is  determined,  then  A has  the  Baire  property. 

Proof.  If  I had  a winning  strategy  in  G**(A  — Oa),  then  for  some  t e <0>co, 
O(t)  — (A  — Oa)  would  be  meager.  But  then,  0(t)  — A would  also  be  meager  and 
so  that  0(t)  C Oa  and  0(t)  — (A  — Oa)  = 0{t),  which  is  a contradiction  (0.11). 
Hence,  II  has  a winning  strategy  in  G**(A  — Oa),  and  so  A — Oa  is  meager.  But 
O a — A is  also  meager  by  definition  of  0A , and  so  A has  the  Baire  property.  H 

The  game  for  the  perfect  set  property  is  based  on  X — 2.  For  A C w2,  G^A) 
is  the  game  formulated  as  follows:  I chooses  members  of  <ro2  and  II,  members 
of  2: 


I : s0  s2 

II:  k\  k2 

The  previous  terminology  adapts,  and  with  x being  the  concatenation 
sq^ {ki)^s2'~'{ki)^' . . . of  the  resulting  play,  I wins  if  r e A,  and  otherwise  II 
wins.  With  “2  topologized  with  basic  open  sets  O(s)  = [x  e ®2  | 5 c x}  for 
s e <a>2,  a perfect  set  is  as  before:  non-empty,  closed  and  containing  no  isolated 
points. 

27.5  Proposition  (Davis  [64]).  For  A C a'2, 

(a)  A is  countable  iff  1 1 has  a winning  strategy > in  Gj(A). 

(b)  A has  a perfect  subset  iff  I has  a winning  strategy  in  Gj(A). 

Proof,  (a)  Suppose  first  that  there  is  an  enumeration  (a,-  \ i e co)  of  A.  Then  II  has 
a simple  winning  strategy:  He  plays  his  ith  move  to  ensure  that  the  concatenation 
of  the  resulting  play  differs  from  a,-. 

Suppose  now  that  II  has  a winning  strategy  r.  Proceeding  as  in  the  proof 
of  27.3(a),  for  a partial  play  according  to  r of  form  p — (so,  k\, s2n,  k2n+i) 
let  = s0^(ki)^  . . . ''s2n~(k2n+ 1>  and  set 

Dp  = [x  e ®2  | p*  c x -a  3t  e <<u2(/?*'~f"r(p''(t})  c .*)}  . 

Then  as  before,  A c |J^ ("2  — Dp). 

Now  for  each  p,  x e ®2  — Dp  exactly  when  p * c and 
Vf  e <w2(p*'~'t~'T(p''(t))  £ x).  But  then,  w2  — Dp  has  a unique  member  xp: 
With  |p* | = m,  xp\m  = p*;  necessarily  xp(m)  — 1 — r(p~'(0});  and  recursively 


xp(e)  = 1 - T(p~(xp(m),  ...,  xp(e  - 1)» 
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for  e > m.  (This  argument  is  possible  because  of  the  switch  from  “si  to  0>2.) 
Hence,  A is  countable. 

(b)  Suppose  first  that  A has  a perfect  subset  P.  Setting 
T — {x\n  \ x e P A n e a>]  , 

a strategy  for  I can  be  described  as  follows:  Let  the  initial  move  be  an  s e T 
such  that  both  s'-'(O)  and  s~(l)  are  in  T ; such  an  s exists  as  P has  no  isolated 
points.  Generally,  in  response  to  a partial  play  p with  concatenation  /?*  e T,  let 
the  move  be  an  s such  that  both  p*^s^( 0)  and  p^s^{  1)  are  in  T;  again  such 
an  ,v  exists  as  P has  no  isolated  points.  This  is  a winning  strategy  for  / as  P is 
closed. 

For  the  converse,  note  that  if  a is  a winning  strategy  for  I , {a  * y \ y e "2} 
is  a perfect  subset  of  ,4.  H 

It  is  simple  to  make  the  transfer  from  c02  to  '"co.  This  can  be  done  on  general 
grounds,  or  by  applying  a specific  map: 

For  n,k  e co  define  bk:  n + 1 — »■  2 as  follows:  If  k is  even,  set  bkn(i)  — 1 for 
i < n and  bkn(n)  = 0;  if  k is  odd,  set  bk(i)  — 0 for  i < n and  bk(n)  — 1.  Then 
define  'P : ‘“co  — > "2  by: 

i P(x)  = b°^0)  b~( 2)  ... 

It  is  straightforward  to  check  that  the  range  consists  of  all  but  the  countably  many 
members  of  ®2  that  are  eventually  constant,  and  that  P is  actually  a homeomor- 
phism  onto  its  range  considered  as  a subspace  of  M2.  Also,  for  B c “2,  B is 
perfect  iff  is  perfect,  and  B is  countable  iff  'P~X{B)  is  countable.  This 

leads  to  the  following: 

27.6  Corollary.  For  A C 10 co,  G\('P"'A)  is  determined  iff  A has  the  perfect  set 
property.  H 

For  Lebesgue  measurability,  Mycielski-Swierczkowski  [64]  provided  the  first 
game  argument.  Harrington  then  found  a simpler  approach  with  his  covering  game. 
This  is  best  cast  as  a game  for  a>2,  but  the  wco  context  is  maintained  through 
contrivance:  For  A <zwco  and  f el  with  e > 0,  the  covering  game  G(A,  e ) is 
played  with  integer  moves: 

I : x (0)  x(2) 

II:  jr(l)  x(3) 

However,  the  payoff  scheme  is  somewhat  involved.  Let  P : 01  co  — > a>2  be  as  above, 
and  on  its  range,  let  be  its  inverse  function.  I must  always  play  0’s  or  l’s  so 
that  the  resulting  x/  is  not  eventually  constant.  Each  of  7/’s  moves  is  regarded 
as  coding  a finite  union  of  basic  open  sets;  for  definiteness,  in  terms  of  the  fixed 
recursive  enumeration  {s,  | i e co)  of  <a)co  set: 
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Nj  — 0( sSj(0))  U . . . U 0(sSj(|S||_i))  . 

II’ s move  x(2 n + 1)  must  then  satisfy 

mL(Nx(2n+i))  < e/22(n+1)  . 

Now  if  either  I or  II  fails  to  meet  these  conditions,  the  first  to  do  so  loses. 
Assuming  that  this  does  not  happen,  the  idea  is  that  II  tries  to  cover  A with 
Unea>Nx(2n+i)-  Accordingly,  I wins  if  |7/_1(*/)  eA-  \Jn€(0  Nx(2n+i),  and  other- 
wise II  wins. 

27.7  Lemma.  In  G(A,  e): 

(a)  If  I has  a winning  strategy’,  there  is  a Lebesgue  measurable  B C A such 
that  mi(B ) > 0. 

(b)  If  II  has  a winning  strategy,  there  is  an  open  set  O 3 A such  that 
mL(0 ) < e. 

Proof,  (a)  Suppose  that  a is  a winning  strategy  for  I.  Then  the  set  of  7’s  outputs 
according  to  er, 

B = {^'((ct  * y)/)  | y G u’a>}  , 

is  seen  to  be  a L]  set  (in  a parameter  coding  er)  and  so  is  Lebesgue  measurable. 
The  rules  of  the  game  imply  that  mL(B ) > 0:  Otherwise,  it  is  simple  (using  0.8(d) 
and  0.9(a))  to  find  (x(2n  + 1)  | new)  so  that  II  wins  a play  even  according  to 
a . 

(b)  Suppose  that  r is  a winning  strategy  for  II.  For  any  5 G <w2  — {0},  say 
with  | ,y|  = n+ 1,  let  d(s)  be  I I’s  response  (the  (2n+ 1 )st  move  of  the  game)  of  the 
play  according  to  r where  /’  s moves  are  successively  s(0), ...,«(»).  Then  as  r is 
winning  for  II,  for  any  igA  there  must  be  an  n e co  such  that  jc  G Nd(<P(x)\(n+\))- 
Consequently,  if 

O = \J{Nd(S)  l*e<82- {0}}  , 

then  A c O.  Moreover,  according  to  the  rules  of  the  game  mL(Nd(S ))  < e/22(n+1) 
where  |s|  = n + 1,  and  there  are  2"+1  such  s’s.  Hence, 

mL{0)  < £„e^/22(',+1))  - 2("+1)  = L„e^/2(”+1)  = e . H 

For  A C aco,  B c wco  a minimal  cover  for  A iff  A c B,  B is  Lebesgue 
measurable,  and  if  Z c B — A is  Lebesgue  measurable,  then  mL(Z ) = 0.  Any 
A C ma)  has  a minimal  cover:  take  any  Lebesgue  measurable  B 3 A with  mL(B) 
minimal.  In  fact,  B can  be  taken  to  be  Gs  by  0.9(b).  Noting  that  27.7  can  be 
established  in  our  ambient  theory  ZF,  the  argument  for  14.4  showing  in  ZF  that 
every  L[  set  is  Lebesgue  measurable,  there  is  the  following  corollary: 

27.8  Corollary.  Suppose  that  A C B C is  a minimal  cover  for  A,  and 
for  any  e G R.  with  e > 0,  G(B  — A,  e)  is  determined.  Then  A is  Lebesgue 
measurable. 
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Proof.  By  choice  of  B , II  must  have  a winning  strategy  in  each  of  the  games 
G(B  — A,  e).  Hence,  B — A c G where  G is  a n?  set  with  mL(G)  = 0.  H 

Determinacy  Hypotheses 

In  Poland,  with  renewed  interest  in  infinite  games  (Mycielski-Zieba  [55],  Myciel- 
ski-Swierczkowski-Zieba  [56])  Mycielski  and  Steinhaus  in  their  [62]  proposed  the 
following  axiom,  now  known  as  the  Axiom  of  Determinacy. 

(AD)  Every  set  of  reals  is  determined  . 

(Steinhaus  [65:465])  recalled  how  the  axiom  came  to  be  formulated.  He  there 
used  the  term  “determinacy”,  but  the  early  papers  on  the  subject  used  the  term 
“determinateness”.  By  the  1970’s  “determinacy”  came  to  prevail.)  The  basic 
theory  of  AD  was  soon  set  out  in  Mycielski  [64, 66]  and  Mycielski-Swierczkowski 
[64]. 

Mycielski  [66]  considered  a further  hypothesis: 

(ADr)  G»W(A)  is  determined  for  every  A C w(woj)  . 

ADr  readily  implies  AD.  AD  was  to  be  the  main  preoccupation,  but  ADr  was  to 
figure  as  a suitably  strong  hypothesis  in  later  developments. 

Concerning  the  regularity  properties,  the  games  above  can  all  be  recast  as 
games  of  form  GW(B).  For  G**(A)  for  example,  in  terms  of  the  fixed  recursive 
enumeration  (s,  | i e co)  of  <u>io  with  so  = 0,  define  B c w(y>  — {0})  by 

x e B iff  st(0)''sx(i)'*'sx(2)'' ...  e A . 

Then  I has  a winning  strategy  in  G**(A)  exactly  when  I has  a winning  strategy 
in  Gco(B),  and  similarly  for  II.  This  leads  to  the  following  summarizing  result: 

27.9  Theorem  (Mycielski-Swierczkowski  [64];  Mazur,  Banach;  Davis  [64]).  As- 
sume AD.  Then  every  set  of  reals  is  Lebesgue  measurable,  has  the  Baire  property, 
and  has  the  perfect  set  property.  H 

This  was  the  main  incentive  behind  the  formulation  of  AD.  AD  thus  contra- 
dicts AC  by  actually  banishing  those  “pathological”  non-measurable  sets  and  the 
like  given  by  unfettered  applications  of  AC.  Nonetheless,  accepting  AC  Mycielski 
and  Steinhaus  wrote  [62: 2]  of  their  axiom  that  it  could  be  considered  as  leading 
to  a nice  submodel: 

It  is  not  the  purpose  of  this  paper  to  depreciate  the  classical  mathematics  with  its 
fundamental  ‘absolute’  intuitions  on  the  universum  of  sets  (to  which  belongs  the  axiom 
of  choice),  but  only  to  propose  another  theory  which  seems  very  interesting  although  its 
consistency  is  problematic.  Our  axiom  can  be  considered  as  a restriction  of  the  classical 
notion  of  a set  leading  to  a smaller  universum,  say  of  determined  sets,  which  reflect 
some  physical  intuitions  which  are  not  fulfilled  by  the  classical  sets  (e.g.  paradoxical 
decompositions  of  the  sphere  are  eliminated  by  [AD]).  Our  axiom  could  be  considered 
as  an  axiom  added  to  the  classical  set  theory  claiming  the  existence  of  a class  of  sets 
satisfying  [AD]  and  the  classical  axioms  without  the  axiom  of  choice. 
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Mycielski  [64: 205]  hoped  for  an  inner  model:  “a  subclass  of  the  class  of  all  sets 
with  the  same  membership  relation.  It  would  be  still  more  pleasant  if  such  a 
submodel  contains  all  the  real  numbers.”  As  Solovay  and  Gaisi  Takeuti  pointed 
out,  the  natural  possibility  is  L( R),  the  smallest  inner  model  containing  every 
real.  If  AD  holds,  then  so  does  ADi(R):  Every  strategy  is  coded  by  a real  and  so 
belongs  to  L(R),  and  the  property  of  being  a winning  strategy  for  a game  Gm(A) 
with  A e L(R)  is  absolute  for  L(R). 

In  the  late  1960’s  Solovay  made  the  prescient  conjecture  that  ADL(R)  holds 
relative  to  some  large  cardinal  hypothesis.  (In  print  Solovay  [69:  60]  conjectured 
that  if  there  is  a supercompact  cardinal,  then  in  L(R)  every  set  of  reals  has  the 
perfect  set  property.)  ZF  + AD  was  never  widely  entertained  as  a serious  alterna- 
tive to  ZFC,  and  increasingly  from  the  early  1970’s  onward  consequences  of  ZF 
+ AD  were  regarded  as  what  holds  in  L(R)  assuming  ADi(R).  This  compromise, 
if  you  will,  reaffirmed  ZFC  as  the  ambient  theory  while  at  the  same  time  vastly 
increasing  the  sets  of  reals  possessing  the  regularity  properties  through  a maximal 
principle. 

AD  itself  provides  a countable  choice  principle,  AQX “&>)  of  §§12, 13: 

27.10  Proposition  (Swierczkowski,  Mycielski  [64],  Scott).  Assume  AD.  Then 
AC  „(“&>),  i.e.  every  countable  set  consisting  of  non-empty  sets  of  reals  has  a choice 
function.  Consequently,  u>\  is  regular. 

Proof.  Suppose  that  [Z„  | n e &>}  c V(wcl>)  — [0].  Define  A C wco  by:  j e A iff 
Xu  £ ZV(0).  Clearly  I cannot  have  a winning  strategy  in  Gm (A).  But  then,  with 
r a winning  strategy  for  II,  /:&>—>•  "cu  defined  by  f(n)  — ({ n , 0,  0, . . .)  * x)u 
is  a choice  function  as  desired.  H 

As  noted  in  §§  12, 13  AC^Ifco)  suffices  for  the  results  in  those  sections,  and  so 
they  can  be  brought  into  play  when  establishing  results  in  ZF  + AD.  Nonetheless, 
it  soon  became  de  rigueur  to  assume  the  stronger  DC  in  much  of  the  investigation 
of  AD.  DC  is  needed  for  those  consequences  of  AD  in  descriptive  set  theory 
depending  on  the  characterization  of  well-foundedness  in  terms  of  the  lack  of 
infinite  descending  chains.  Moreover,  DC  relativizes  to  L(R)  (11.13),  the  natural 
inner  model  for  AD. 

As  described  in  § 1 1 Cantor  initiated  a programmatic  approach  to  the  Con- 
tinuum Hypothesis  via  the  perfect  set  property.  Since  AD  implies  the  perfect  set 
property  for  every  set  of  reals,  AD  realizes  that  program  in  the  sense  that  for  no  set 
X does  Ko  < |A|  < 2K°  (where  < between  “cardinals”  in  the  ZF  setting  is  defined 
via  injective  embeddability).  However,  it  should  be  stressed  that  Cantor  wanted 
to  establish  the  stronger  2K°  = Ki,  i.e.  that  there  is  some  well-ordering  of  the 
continuum  in  ordertype  w\.  Observations  that  were  applied  in  §11,  incorporating 
a deduction  by  Specker  [57],  were  first  made  for  AD: 
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27.11  Proposition  (Mycielski  [64]). 

(a)  Assume  AD.  Then  u>\  2^°,  i.e.  there  is  no  uncountable  well-orderable 

set  of  reals. 

(b)  Con(ZF  + AD)  implies  Con(ZFC  + 3k  (k  is  inaccessible)). 

Proof,  (a)  follows  from  11.4(b),  and  (b),  from  (a),  the  regularity  of  &>i  (27.10), 
and  (c)  — >■  (a)  of  11.6.  H 

(b)  was  the  first  connection  made  with  large  cardinals.  It  follows  from  (a) 
that  an  extension  of  AD  in  a different  direction  from  ADr  is  already  inconsistent 
with  ZF: 

27.12  Exercise  (Mycielski  [64]).  There  is  an  A C ®coi  such  that  Gm  (A)  is  not 
determined. 

Hint.  Let  A C wa>\  be  defined  by:  x e A iff  x(0)  > co  and  Xu  is  not  an 
injective  enumeration  of  x(0)  = [f  | £ < v(0)}.  If  GWl(A)  is  determined,  it 
must  be  II  who  has  a winning  strategy,  say  r.  For  any  z e 01oj\  with  z(0)  > co, 
{( m,n ) | z * r(2 m + 1 ) < z * r (2 n + 1)}  is  then  a well-ordering  of  ordertype  z(0). 
It  follows  that  there  is  an  injection:  co\  ->  01  co  definable  from  r,  so  that  AD  fails 
by  27.11(a).  H 

This  argument  does  not  provide  an  A C 0>(x>\  such  that  GMl(A)  is  not  de- 
termined, and  curiously  it  may  not  be  possible  to  do  so  (see  Harrington-Kechris 
[81: 133-134]). 

With  full  AD  confronting  AC  and  raising  problematic  issues,  a natural  ap- 
proach in  the  ZFC  context  is  simply  to  consider  determinacy  as  yet  another  regu- 
larity property  of  sets  of  reals,  one  that  subsumes  the  others  by  local  versions  of 
27.9,  and  to  investigate  its  possible  extent.  Consider 

(Det(A))  Every  set  of  reals  in  A is  determined  . 

Thus,  AD  is  DetfPC "&>)),  and  ADi(R)  is  Det(L(R))  by  absoluteness.  Defill',) 
is  known  as  II*  Determinacy,  and  so  forth.  Focal  became  the  hypothesis  of 
Projective  Determinacy. 

(PD)  Det(LUIT‘)  . 

The  consequences  of  PD  will  be  analyzed  at  length  in  §29,  and  Borel  Determinacy 
discussed  after  31.2.  Ilj  Determinacy  was  soon  seen  to  transcend  ZF,  by  the  first 
extension  of  27.1 1(b): 

27.13  Exercise  (AC^f0®)).  Det(nj)  implies  Wa  e aco(co\  is  inaccessible  in  L[fl]). 

Hint.  By  11.5  it  suffices  to  show  that  co\^  < co\  for  every  a e l°co.  But  if 
c o = co\,  then  by  13.12  relativized  there  would  a be  n\{a)  set  A without 
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the  perfect  set  property.  Now  apply  27.6,  after  checking  that  G*(<^“A)  can  be 
construed  as  GC0{B)  for  some  Tl\(a)  set  B of  reals.  H 

The  concept  of  determinacy  having  emerged  and  its  basic  consequences  and 
connections  worked  out,  two  seminal  results  appeared  in  1967  as  was  mentioned 
at  the  outset.  With  the  result  of  Blackwell  [67]  to  serve  as  the  starting  point  of 
§29,  the  development  from  the  other  result  is  explored  in  the  upcoming  section. 
But  before  concluding  this  one,  some  related  results  of  later  vintage  are  tucked  in 
that  will  serve  subsequent  commentary  as  counterpoint. 

A local  version  of  27.9  is:  If  Det(II* ),  then  every  FT,'  set  of  reals  is  Lebesgue 
measurable,  has  the  Baire  property,  and  has  the  perfect  set  property.  As  antici- 
pated, this  merely  requires  checking  for  n > 0 that  for  any  II,'  set  A of  reals,  the 
games  corresponding  to  the  regularity  properties  for  A can  each  be  recast  in  form 
Go,(B)  for  some  IT,'  set  B of  reals. 

It  was  noticed  in  the  early  1970's  that  a simple  way  of  modifying  games  due 
to  Solovay  called  “unfolding”  leads  to  stronger  conclusions: 

27.14  Exercise  (AC^PaOKKechris,  Martin).  Assume  Defill*).  Then  every  E,'l+ , 
set  of  reals  is  Lebesgue  measurable,  has  the  Baire  property,  and  has  the  perfect  set 
property. 

Hint.  Suppose  that  A C.‘°a>  is  E*+1.  To  establish  the  Baire  property  for  A,  apply 
Solovay’ s unfolding  idea  as  follows: 

Let  0A  be  the  open  set  as  in  the  statement  of  27.4,  and  set  IS  — A — 0A,  a 
L,'l+ 1 set.  Let  C c 2(fcd)  be  a IT2  set  such  that  B — pC.  The  unfolded  version 
of  the  game  G**(B)  is  played  as  follows: 

I ■■  (y(0),s0)  OKI),  s2) 

II:  si  S3 

As  for  G**{B),  I and  II  choose  members  s,  of  <<0u>  — [0],  but  in  addition  1 
chooses  y(i)  e a>.  With  x the  concatenation  so'^s1^'s2^  . . . and  y e wa>  the  result 
of  /’  s integer  choices,  I wins  if  (x,  y)  e C,  and  otherwise  II  wins.  As  C is  IT', 
this  unfolded  game  can  be  cast  as  one  of  form  Gm{D)  for  some  TT('  set  D of  reals, 
and  so  the  game  is  determined  by  hypothesis. 

Suppose  first  that  II  has  a winning  strategy  r.  Then  argue  much  as  for 
27.3(a)  that  B must  be  meager:  Disregarding  /’ s integer  moves  define  the  open 
dense  sets  Dp  as  before,  and  then  show  that  if  x e p Dp,  then  x $ B.  (If  to 
the  contrary  x e B,  then  there  would  be  a y e aco  such  that  {x,  y)  e C.  But 
then,  a play  according  to  r can  be  recursively  defined  with  y(i)’s  as  /’ s integer 
moves  and  x as  the  concatenation  of  the  finite  sequences,  contradicting  r being  a 
winning  strategy.) 

Suppose  next  that  I has  a winning  strategy  a . Then  I has  a winning  strategy 
in  the  original  game  G**(B):  He  simply  keeps  the  y(i)’ s to  himself  and  plays 
only  the  s(2i)’s.  Hence,  by  27.3(b),  0(s)  — B is  meager  for  some  s e <<0cl>. 

Now  conclude  by  the  argument  for  27.4  that  A has  the  Baire  property. 
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For  the  perfect  set  property,  argue  similarly  with  an  unfolded  version  of 
the  game  G\i}P^A)  where  I makes  extra  choices  y{i)  (cf.  27.5  and  27.6).  The 
resulting  “witness”  y can  be  taken  to  be  in  “2  or  just  in  coa>,  depending  on  whether 
the  homeomorphism  — > "2  is  to  be  applied  first  or  not. 

Finally,  establish  Lebesgue  measurability  for  the  complement  A — mco  — A 
as  follows:  As  described  before  27.8  let  B be  a Gs  minimal  cover  for  A,  and  for 
rational  e e R with  e > 0 argue  with  unfolded  versions  of  G(B  — A,  e)  where 
I makes  extra  choices  y(i)  (cf.  27.7  and  27.8).  Dealing  with  the  complement  A 
instead  of  A , these  games  can  be  recast  in  form  G,J  D)  for  some  IT,1,  set  D of 
reals.  Again,  the  resulting  “witness”  y can  be  taken  to  be  in  ®2  or  just  in  10 co.  H 

As  a nice  complement,  note  that  the  n — 0 case  provides  a new  proof,  based 
on  the  determinacy  of  closed  games,  of  the  classical  result  that  the  analytic  sets 
have  the  regularity  properties. 

The  next  observations  concern  ADr,  that  mightier  hypothesis  lurking  large 
and  inchoate  behind  AD.  In  analogy  to  27.10,  ADR  implies  that  every  M&>-indexed 
family  of  nonempty  sets  of  reals  has  a choice  function: 

27.15  Exercise.  Assume  ADr.  Then  every  subset  of  2 (fed)  can  be  uniformized. 

Hint.  Suppose  that  A c 2(“&>)  and  consider  the  two-move  game  where  I chooses 
a real  x and  II  wins  iff  he  responds  with  a real  y such  that  (x,  y)  e A.  ADr 
implies  that  II  must  have  a winning  strategy,  and  such  a strategy  provides  a 
uniformization.  H 

In  contrast,  the  following  delimitation  was  seen  through  a diagonalization 
argument  showing  that  unlike  AD,  ADr  does  not  relativize  to  L(R)  as  a natural 
inner  model. 

27.16  Proposition  (Solovay  [78a:  176]).  Assume  V = L(R)  and  there  is  no  well- 
ordering  of  the  reals.  Then  there  is  a subset  of  2(a>co)  that  cannot  be  uniformized. 

Proof.  For  any  x e wco,  let  OD(x)  denote  the  class  of  sets  definable  using 
parameters  from  On  U {x}.  As  for  the  well-known  class  OD  of  ordinal  definable 
sets,  OD(x)  has  a definable  well-ordering,  and  the  2-ary  ry  e OD(x)1  is  definable 
via  the  Reflection  Principle  for  ZF.  Let  Ac2  (fod)  be  such  that 

(x,  y)  e A **  y OD(x)  . 

Since  by  hypothesis  there  is  no  well-ordering  of  the  reals,  Vx3y((x,  y)  e A). 

Assume  now  to  the  contrary  that  there  is  a B C A uniformizing  A.  Since 
by  hypothesis  V = L(M.)  and  there  is  a definable  surjection:  On  x ®w  ->  L(R) 
(cf.  the  proof  of  11.13),  B e OD(xo)  for  some  xo  e ® a>.  But  then,  that  unique  yo 
such  that  B(x o,  yo)  would  be  in  OD(xq).  Contradiction!  H 
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For  any  S c wco,  by  considering  an  analogous  concept  OD^S,  x})  for  x ewco 
and  noting  that  {y  e m(o  \ y e OD([S,  x})}  is  still  well-orderable,  27.16  can  be 
extended  to  L(R  U {S'}).  This  leads  to  the  following: 

27.17  Proposition  (Solovay  [78a:  177]).  Assume  AD®.  Then  for  any  S c 01  oj, 

f/L(RU{5]).  ^ H 

Solovay  [78a]  went  on  to  formulate  sharps  for  sets  of  reals  and  show  that 
ADr  implies  their  existence  for  every  such  set.  In  contrast  to  the  situation  for 
AD,  for  quite  some  time  it  was  unclear  whether  there  is  a natural  inner  model  of 
ADr  containing  every  real.  This  issue  was  resolved  in  the  1980’s  as  broached  in 
the  last  section  (cf.  32.19). 

The  final  remarks  concern  the  Ramsey  property  discussed  at  the  end  of  § 1 1 ; 
for  present  purposes  the  V(co)  context  there  is  shifted  to  01  o>.  Being  Ramsey  is 
also  regarded  as  a regularity  property  for  sets  of  reals.  Like  the  others,  every  set 
of  reals  is  Ramsey  in  Solovay’s  model  (11.15),  and  there  was  progress  through 
the  first  levels  of  the  projective  hierarchy:  Galvin-Prikry  [73]  showed  that  every 
Borel  set  of  reals  is  Ramsey,  and  Silver  [70a]  that  every  analytic  set  is  Ramsey, 
and  if  there  is  measurable  cardinal,  that  every  set  is  Ramsey.  Erik  Ellentuck 
[74]  devised  proofs  of  these  results  using  a topology  based  on  the  Mathias  forcing 
conditions  and  observing  that  a set  of  reals  is  Ramsey  exactly  when  it  ( through  an 
identification)  has  the  Baire  property  in  that  topology.  Making  the  first  connection 
with  determinacy,  Prikry  [76]  then  appealed  to  the  determinacy  of  the  correspond- 
ing **-game  with  real  moves  (cf.  27.3)  to  show  that  ADr  implies  that  every  set 
of  reals  is  Ramsey,  (llias  Kastanas  [83]  established  related  results.)  Consequences 
of  later  developments  are  that  PD  implies  that  every  projective  set  of  reals  is  Ram- 
sey (Harrington-Kechris  [81]),  and  that  AD  and  V — L(R)  implies  that  every  set 
of  reals  is  Ramsey  (Martin-Steel  [83]).  However,  with  the  corresponding  games 
naturally  based  on  real  moves,  the  following  question  remains  open: 

27.18  Question.  Does  AD  imply  that  every  set  of  reals  is  Ramsey? 
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Solovay  established  in  1967  that  AD  implies  that  w\  is  measurable,  injecting 
emerging  large  cardinal  intuitions,  techniques,  and  results  into  a novel  setting 
without  AC.  Unexpected  was  how  strong  an  effect  AD  has  on  the  transfinite,  and 
intriguing  were  the  possibilities  for  further  large  cardinal  consequences.  Others 
soon  joined  in,  with  Martin  making  particularly  incisive  contributions,  in  a first 
wave  of  what  was  to  become  a broad  combinatorial  investigation.  This  section 
pursues  this  development,  emphasizing  the  richness  of  the  game  paradigm  with  a 
variety  of  games.  As  for  §27, 

ZF  serves  as  the  ambient  theory  for  this  section  . 

Although  the  preoccupation  is  with  direct  consequences  of  AD,  it  is  noted  again 
that  they  were  increasingly  regarded  in  ZFC  as  what  holds  in  L(R)  assuming 
ADL(R). 

The  following  was  an  early  observation  about  ultrafilters. 


28.1  Proposition.  Assume  AD.  Then  there  are  no  (non-principal)  ultrafilters  over 
cd.  Hence,  ever y ultrafilter  is  w\-complete. 

Proof.  Assume  to  the  contrary  that  there  is  an  ultrafilter  U over  cd.  Consider  a 
game  where  the  players  choose  members  of  [&>]<<u: 

I : s0  s2 

II:  si  S3 

If  a move  sn  is  not  disjoint  from  U,<„  si>  then  the  player  to  make  the  first  such 
move  loses.  Assuming  that  this  does  not  happen,  I wins  if  (Jiea)  S2;  € U,  and 
otherwise  II  wins.  A contradiction  is  derived  by  showing  that  a winning  strategy 
for  either  player  can  be  converted  to  one  for  the  other: 

Suppose  first  that  a is  a winning  strategy  for  I.  Define  a strategy  rCT  for  II 
by: 

rCT((s  o»  = er(0),  and  for  0 < i e co,  ra((s0, ...,  s2i))  = er((si, . . . , s2(»  - s0  , 


i.e.  ignore  the  first  move  s 0 and  play  according  to  a,  subtracting  s 0 to  maintain 
disjointness  of  moves.  For  any  play  (so,  si,  s2,  • • ■)  according  to  rCT  consisting  of 
disjoint  moves,  so  U .s’2,+1  e U as  a is  winning  for  I,  and  so  [_]i€los2i+\  e U 
as  U is  presumed  non-principal.  Flence,  t„-  is  winning  for  II  since  U is  ultra  - a 
contradiction  in  this  case. 

Suppose  next  that  r is  a winning  strategy  for  II.  First  modify  r to  a strategy 
r for  II  defined  by: 

( t((s0,  . . . , s2,»  u {/}  if  i <£  U j <2i  Sj  , and 
r((s0,  ...,s2,»  = j 

[ r((so,  ■ . . , s2i))  otherwise  . 


Then  r is  also  winning  for  II,  and  moreover  for  any  play  (so,  si,  S2  . . .)  according 
to  r consisting  of  disjoint  moves,  (J;e®  s2i+i  e U as  U,ea)  si  — ® and  U is  ultra. 


384 


Chapter  6.  Determinacy 


Now  define  a strategy  crT  for  I by 

oy«s0,  • • • , s2i~\))  = r«0,  s0.  ■ • • . S2i— 1»  , 

i.e.  assume  there  is  a first  move  0 and  play  according  to  r . Then  as  in  the  previous 
argument,  ay  is  winning  for  I - a contradiction  in  this  case  also. 

For  the  second  assertion,  if  W were  an  ultrafilter  over  a set  S which  is  not 
u>i  -complete,  there  would  be  a function  f:  S -»  co  such  that 

MW)  = {XCa>  | f~\X)e  W] 

is  a (non-principal)  ultrafilter  over  co.  H 

This  result  can  also  be  seen,  albeit  less  directly,  via  27.9  and  the  result 
of  Sierpinski  [38]  that  an  ultrafilter  over  co  construed  as  a set  of  reals  is  not 
Lebesgue  measurable.  Of  course,  co\  -completeness  is  only  of  consequence  if  there 
are  ultrafilters  at  all  under  AD,  and  this  was  established  by  Solovay: 

28.2  Theorem  (Solovay).  Assume  AD.  Then  a>\  is  measurable. 

Measurable  cardinals  are  regular  in  ZF  because  of  the  completeness  of  their 
witnessing  ultrafilters.  It  is  AC  that  renders  them  weakly  inaccessible  (and  much 
more)  through  the  enumerability  of  power  sets.  However,  the  absence  of  AC  does 
not  affect  consistency  strength.  rZF  + co\  is  measurable1  is  equiconsistent  with 
rZFC  + 3 k{k  is  measurable)1:  Jech  [68],  Levy,  and  Takeuti  [70]  each  observed 
that  in  ZFC  if  a measurable  cardinal  is  Levy  collapsed  to  co\,  then  in  an  inner 
model  of  the  resulting  extension  co\  is  measurable.  Conversely,  if  in  ZF,  U is  a 
/c-complete  ultrafilter  over  some  k > co,  then  L[U]  \=  ZFC  + k is  measurable. 
Moreover,  for  any  a e coco,  L[U,  a]  |=  ZFC  + k is  measurable,  where  L[U,a], 
defined  using  predicate  symbols  for  U and  a,  is  the  smallest  inner  model  such  that 
U fi  M e M and  a e M.  Thus,  although  AC  is  not  actually  needed  in  §9,  a ZFC 
context  is  in  any  case  available  to  develop  the  theory  of  a# , and  its  absoluteness 
properties  (cf.  14.12(a))  ensure  its  substance. 

The  following  was  an  early  extension  of  27.11(b)  about  the  consistency 
strength  of  AD. 

28.3  Corollary. 

(a)  Con(ZF  + AD)  implies  Con(ZFC  + 3 k{k  is  measurable)). 

(b)  Assume  AD.  Then  Wa  e ‘°co(a#  exists).  H 

The  measurability  of  o>i  also  has  direct  consequences  in  ZF,  e.g.  there  are  no 
cui-Aronszajn  trees  (cf.  7.8,  7.10). 

Starting  with  his  original  argument  two  proofs  of  Solovay’ s theorem  are 
given,  both  of  which  became  paradigmatic  for  further  work. 
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First  Proof  of  28.2.  Solovay  devised  a game  where  the  players  are  required  to 
play  codes  for  well-orderings  and  used  a bounding  argument  with  £[  sets  to  show 
that  under  AD,  the  closed  unbounded  filter  C01)  over  u> \ is  an  ultrafilter.  By  28.1 
the  measurability  of  to i follows. 

To  motivate  Solovay’ s argument  consider  for  Y C an  a game  where  the 
players  choose  countable  ordinals: 

I '■  % o 

II  : £3 

I wins  if  (sup{§,-  | i e to})  e Y,  and  otherwise  II  wins.  It  is  argued  that  if  I has 
a winning  strategy  a,  then  Y e CWl  (and  the  argument  will  show  that  if  II  has  a 
winning  strategy,  then  oo\  — Y e CMI): 

As  u>\  is  regular  (27.10),  for  each  rj  < an  let  w(?;)  < to\  be  least  so  that 
a\<u>rj  C w(r]),  i.e.  for  any  finite  partial  play  all  of  whose  moves  (or  equivalently, 
just  //’ s moves)  are  less  than  the  strategic  response  according  to  a is  less  than 
w(ri).  Then 

C = {p  < to\  | p is  an  infinite  limit  A Wr/  < p(w{p)  < p)} 

is  closed  unbounded  in  to \,  and  C c Y:  Suppose  that  p e C,  and  let  [rjj  | i e 
to}  C.  p be,  unbounded  in  p.  Then  in  the  play  a * (p,-  | i e to)  each  move  is  less 
than  p,  and  since  o is  winning,  p e Y. 

Solovay’s  idea  was  to  simulate  this  game  by  one  to  which  AD  is  applicable. 
Consider  for  Y c.  a>\  a game  where  the  players  choose  integers: 

I : x (0)  v (2) 

// : x(l)  x(3) 

Recall  from  §13  that  each  y ewa>  encodes  a sequence  ((y),-  | i e to)  e m(fto)  and 
each  z e 01  to  encodes  a relation  Ez  c.  cox  to  with  ||z||  denoting  its  ordertype  when  it 
is  a well-ordering.  I is  regarded  as  playing  a sequence  of  relations  (£(.V/),  | i e to) 
and  II,  {E(Xll).  \ i e to).  Suppose  first  that  for  some  i e to,  either  E(xi)i  or 
E(Xu ).  is  not  a well-ordering;  then  for  the  least  such  i,  II  wins  if  EiX]).  is  not  a 
well-ordering,  and  otherwise  I wins.  Assuming  that  this  did  not  happen,  I wins 
if 

sup({ ||  (x/)/ 1|  | i e to}  U {||  (*//)/ II  I i e «})  e Y , 
and  otherwise  II  wins. 

This  game  can  be  construed  as  a game  GW(A)  for  some  A c “ to,  and  so  is 
determined.  We  argue  that,  as  before,  if  I has  a winning  strategy  <7,  then  Y e C ^ 
(and  the  argument  will  show  that  if  II  has  a winning  strategy,  then  u>\  — Y eCWl) 
thus  completing  the  proof  by  confirming  that  C ^ is  an  ultrafilter. 

Considering  the  previous  argument,  it  will  suffice  to  get  bounds  in  the  fol- 
lowing sense:  For  each  77  < <a>i  , there  is  a /3  < oo\  such  that:  for  any  n e to  and 
y e “ to,  if  for  each  i < n,  E(y).(=  E((a*y)u).)  is  a well-ordering  with  ||(y);||  < 77, 
then  £’((ct*j’),)„  is  a well-ordering  with  ||((er*y)/)„||  < /S.  So,  suppose  that  p e co\. 
Set 
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A,,  = {((<7  * y)/)„  I new  a a 

V/  < n(E(y).  is  a well-ordering  with  || (>>),■  ||  < ?;)}  . 

Then  A,,  is  Hj  in  a real  coding  the  ordertype  r),  since  it  can  be  defined  in  terms 
of  embeddability  into  that  ordertype  (cf.  the  second  proof  of  13.4).  Moreover,  in 
terms  of  13.6,  A,,  C WF  as  a is  winning,  and  so  A,  C WF^  for  some  f < co\. 
Flence,  a /I  as  desired  has  indeed  been  found.  Taking  wfq)  to  be  the  least  such 
P,  the  proof  can  now  be  completed  with  the  defined  function  w:  a>\  — >■  a>\  as 
anticipated.  H 

For  the  foregoing  and  the  forthcoming,  note  again  for  provability  in  ZF  that 
AD  implies  AC0J(0J oj)  (27.10),  which  suffices  for  the  results  of  §13.  That  CWx  itself 
witnesses  the  measurability  of  an  turned  out  to  be  a substantial  hypothesis.  In 
the  mid-1970’s  Martin  and  Mitchell  (see  their  [79])  applied  Kunen’s  methods  for 
2 1 .25  to  conclude  from  the  hypothesis  that  for  every  X there  is  an  inner  model  of 
ZFC  with  X measurable  cardinals.  By  the  early  1980’s  the  hypothesis  was  gauged 
as  considerably  stronger  (see  Radin  Forcing  in  volume  II). 

Martin  provided  a proof  of  Solovay’s  theorem  via  a nice  observation  of 
independent  significance.  Like  Solovay,  Martin  made  crucial  contributions  in  this 
area  as  well  as  in  descriptive  set  theory  (§§29,30),  and  moreover  was  to  forge  the 
initiative  for  establishing  the  consistency  of  determinacy  hypotheses  (§31).  Some 
well-known  concepts  and  notation  are  first  affirmed:  For  a,  b e m a>, 

a <t  b iff  a is  recursive  in  b , 
a =t  b iff  a <j  b A b <j  a , and 
[a]t  — {c  6 wa>  | c =x  a]  ■ 

<t  is  the  well-known  relation  of  Turing  reducibility,  and  =t  that  of  Turing  equiv- 
alence, with  the  corresponding  equivalence  classes  [a]t  being  the  Turing  degrees. 
Every  a e ® a>  has  countably  many  <T-predecessors;  in  terms  of  §12  there  are 
countably  many  formulas  to  consider.  Letting  VT  be  the  set  of  such  degrees, 

T) x = {[a]r  | a € wa>}  , 

<t  lifts  to  an  ordering  on  Vj  well-defined  by: 

do  < d\  iff  3a3b(do  = [a]t  a d\  = [b[ t a a <t  b)  . 

Martin  considered  the  filter  Mj  over  T>j  generated  by  the  “cones”  of  degrees: 

X e Mt  iff  e VT({d  e VT  \ d0  < d}  c X)  . 

That  it  is  indeed  a filter  is  simple  to  see,  since  for  any  a,  b e w a>  there  is  always 
a c e “&>  such  that  a <T  c and  b <x  c.  Generalizing  this,  AC^,  implies  that  Mj  is 
an  -complete.  On  the  other  hand,  AC  implies  that  Mt  is  not  an  ultrafilter  (cf.  the 
proof  of  11.4(a),  noting  that  every  Turing  degree  is  a countable  set). 
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28.4  Proposition  (Martin  [68]).  Assume  AD.  Then  MT  is  an  ultrafilter. 

Proof.  Suppose  that  X C 2?x,  and  consider  the  game  Gffl( UZ).  Assume  first 
that  I has  a winning  strategy  o in  this  game.  Considering  o as  a member  of  u,a> 
(through  a recursive  coding  of  <<Mcw)  set  d0  = [a]T.  Then  for  any  d e VT  with 
do  < d and  y e d,  clearly  a * y e d and  hence  d e X.  A similar  argument 
shows  that  if  II  has  a winning  strategy  r,  then  {d  e Vj  \ [r]T  < d}  c Vj  — X. 

H 


With  AD,  that  MT  is  an  ultrafilter  already  implies  that  it  is  u>\  -complete  by 

28.1. 

Second  Proof  of  28.2.  For  a,  b e a,w  with  a =j  b,  aq  and  this  is  a 

countable  ordinal  (27.13).  Thus,  f:  VT  ->  a>\  given  by: 

/([«]  t)  = co\[a] 


is  well-defined.  With  28.4, 

f,(MT)  = {X  c an  | f~\X)  e Mt} 

is  then  an  u>\  -complete  ultrafilter  over  a>\.  (To  check  that  it  is  non-principal,  note 
that  for  any  a < co \ and  a ewa>  there  is  a Z?  e with  a <T  b such  that  a < co^h\ 
so  that  /“'({a})  ^ Mj.)  H 

There  is  such  an  / so  that  /*(MT)  = CWl;  the  proof  of  28.20  provides  a 
general  result  along  these  lines. 

Solovay  in  the  meanwhile  had  established  a structure  result  important  for 
later  work,  playing  the  first  of  what  came  to  be  known  as  a Solovay  game , a game 
where  the  players  must  again  play  codes  for  countable  ordinals,  but  where  the 
rules  preclude  I from  having  a winning  strategy. 

28.5  Theorem  (Solovay).  Assume  AD.  Suppose  that  Y C o> \ . Then  there  is  an 
a e wa>  such  that  Y e L[a\ 

Proof.  Consider  a game  where  the  players  choose  integers: 

I : v(0)  x(2) 

II:  jc(1)  *(3) 

The  rules  this  time  are  that  II  wins  if  either  EX]  is  not  a well-ordering,  or  else 
E(Xn)i  is  a well-ordering  for  every  i e a,co  and  [|| (*//);  ||  | i e co}  — Y fi  f>  for 
some  f > ||x/||.  Otherwise,  I wins. 

Using  a bounding  argument  as  in  Solovay’s  previous  proof,  it  is  first  estab- 
lished that  I cannot  have  a winning  strategy:  Assuming  that  a is  such  a strategy, 
{(a  * y)i  | ye  “&>}  is  a subset  of  WF.  Hence,  there  is  a ft  < a>\  such  that 
for  any  y e mu>,  ||(er  * y)/||  < ft.  Consequently,  for  any  y e ma>  with  E(y).  a 
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well-ordering  for  every  i e co  and  {||(y),  ||  | i e co}  = Y fl  fl,  a * y is  a play  that 
II  wins  - a contradiction. 

It  follows  that  II  must  have  a winning  strategy,  say  r.  The  proof  is  completed 
by  showing  that  Y e L[r]  (regarding  r as  a real):  Note  first  that  there  is  a simple 
formula  <p(vo,  v\)  such  that  for  any  z € ®o>  with  Ez  a well-ordering, 

Ikll  e T iff  <p[z,  r]  . 

As  r is  winning,  cp  can  assert  that  there  is  an  i e co  and  a real  coding  an  isomor- 
phism between  {co,  Ez)  and  {co,  T)„),}-  Now  if  co ii[rl  = an,  then  definability 
via  such  a cp  and  absoluteness  would  imply  that  Y e L[t],  However,  co\  is 
inaccessible  in  L[r]  by  27.13,  but  a forcing  argument  can  be  applied: 

For  each  f < co\,  let  P|  be  the  p.o.  for  adjoining  a surjection  from  co  onto 
§ via  finite  conditions,  and  let  z.%  be  a P^-name,  defined  uniformly  in  £,  for  a 
z e “co  such  that  Ez  has  ordertype  f in  any  generic  extension  via  P^ . Then  for  <p 
as  above, 

HeY  iff  L[  r]|=(b%  <Pizf,i))  ■ 

For  the  right-to-left  direction  note  that  V{P$)  fl  L[r]  is  countable  by  the  inacces- 
sibility of  cd i in  L[ r],  and  so  there  are  P^-generics  over  P[r],  It  follows  by  this 
definability  (and  an  application  of  the  ZF  Reflection  Principle)  that  I 6 I[r],  H 

Kechris  [88b]  has  recent  results  on  subsets  of  co\  so  constructive  from  a real. 
Upon  seeing  Martin’s  result  28.4  Solovay  raised  the  ante  in  1968  with  the 
following  result: 

28.6  Theorem  (Solovay).  Assume  AD.  Then  coi  is  measurable. 

Proof.  The  argument  starts  out  as  for  the  second  proof  of  28.2  and  then  has 
features  of  the  proof  of  28.5.  For  a.b  ema>  with  a =T  b,  (<ui)+L[fl]  = (w\)+L^, 
i.e.  the  respective  successors  in  L[a]  and  in  L[b]  of  (the  real)  co\  are  equal,  and  this 
ordinal  is  less  than  (the  real)  002  by  the  existence  of  sharps.  Thus,  f:T> t — >■  002 
given  by 

/(Mr)  = («i)+L[fl] 

is  well-defined.  Using  28.4, 

/*(Mt)  = {X  c <02  | f~\x)  e Mt} 

is  then  an  co\  -complete  ultrafilter  over  C02.  (To  check  that  it  is  non-principal,  note 
that  for  any  a < 002  and  a e coco,  if  p:  a -»  co\  is  an  injection,  then  by  28.5  there  is 
a b e mca  such  that  p e L[b],  and  taking  a <T  b implies  that  /_1({fl})  ^ /*(MT).) 
To  complete  the  proof,  it  suffices  to  verify  ^-completeness: 

Suppose  then  that  {2^  | ^ < cof  c /*(Mj)  where  it  can  assumed  that 
Xrj  C Xj:  for  | < p < coi  by  an -completeness.  It  must  be  shown  that  e 

To  this  end,  consider  the  following  Solovay  game: 
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I : x (0)  x (2) 

II:  x(l)  x(3) 

II  wins  if  either  EXI  is  not  a well-ordering,  or  else  {d  e VT  | [x//]T  < d}  c 

A bounding  argument  as  for  28.5  shows  that  I cannot  have  a winning  strategy: 
Assuming  that  a is  such  a strategy,  {(a  * y)i  \ y e “»}  is  a L]  subset  of  WF, 
and  so  there  is  a f < co\  such  that  ||(er  * y)/||  < /3  for  every  y e 01  co.  Since 
Xp  e /*(Mt),  there  is  a be  coco  such  that  {d  e Vj  | [b]T  < d]  c f~x{Xp).  But 
then,  as  f < /I  implies  that  f~l(Xp)  C f~1(X p),  it  follows  that  a * b is  a play 
that  I loses  - a contradiction. 

It  follows  that  II  must  have  a winning  strategy,  say  r.  Regarding  r as  a real, 
the  proof  is  completed  by  showing  that  {d  e Vj  | [r]x  < d}  c /_1(p \^<m  X j): 

Suppose  that  § < and  x e with  r <T  x;  it  must  be  shown  that 
/([x]T)  e X$.  As  in  the  proof  of  28.5  let  P 'j  be  the  p.o.  for  adjoining  a surjection 
from  co  onto  f via  finite  conditions.  Since  V(P%)  D L[x]  is  countable  by  the 
inaccessibility  of  co\  in  L[x],  there  is  a g P^-generic  over  L[x].  g can  be  regarded 
as  a real  such  that  ||g||  = p Let  y e 01  co  be  such  that  x <j  g <t 
and  L[x][g]  = P[y].  Since  forcing  with  P^  preserves  all  cardinals  greater  than 
/([x]T)  = /([v]t).  Now  r is  winning  for  II,  so  if  h — (g  * r )//,  then 
{<f  e 2?t  | [/z]T  < d]  c /_1(X|).  But  it  follows  from  r <T  x and  the  choice  of  y 
that  h <T  y.  Hence,  /([x]T)  = /([v]t)  6%  H 

With  an  apparent  trend  set,  quite  unexpected  was  the  next  advance:  Martin 
in  1970  established  that  under  AD  the  con’s  for  3 < n < oo  are  singular  with 
cofinality  co2.  It  is  a brave  new  world  that  has  such  properties!  Martin’s  result 
was  a corollary  to  his  incisive  analysis  of  £3  sets  under  AD.  Setting  the  stage, 
recall  from  §14  that  under  the  assumption  Vu  e wco(a#  exists),  u j denotes  the  fth 
uniform  indiscernible,  so  that  u\  — co \ and  up  < cop  for  | > 0. 

AD  implies  u2  — a>i  ■ 

This  follows  from  the  argument  for  28.6,  which  can  be  viewed  as  showing  that  u2 
is  a measurable  cardinal;  put  another  way,  co2  — sup(ran(/))  < u2,  the  equality 
by  the  ^-completeness  of  /*(MT)  and  the  inequality  by  the  definition  of  /. 

The  following  is  established,  working  with  £3  sets  and  assuming  a coding 
lemma,  as  30.18: 

28.7  Theorem  (Martin).  Assume  AD.  Then  u„,  = co,,,. 

With  the  regularity  of  co2  from  28.6  and  the  above  observation  that  u2  — co2, 
Martin’s  corollary  follows: 

28.8  Corollary.  Assume  AD.  Then  cf (co„)  = co2for  every  2 < n < co. 

Proof.  For  2 < n < co,  co„  = Uk  for  some  k,  which  by  the  theorem  satisfies 
1 < k < co.  But  then  by  Solovay’s  14.18(a),  cf(&>„)  = cf(«2)  = co2. 
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28.7  in  fact  implies  that  the  first  u>  infinite  cardinals  are  exactly  the  first  w 
uniform  indiscemibles: 

28.9  Theorem  (DC)(Kunen;  Solovay).  Assume  AD.  Then  for  I < n < co,  un  = co„. 

Proof.  Ultrapowers  of  sets  (regarded  as  structures  for  Ce)  via  co\ -complete  ultra- 
filters lack  infinite  descending  chains,  and  so  DC  ensures  that  such  ultrapowers  are 
well-founded.  In  particular,  such  ultrapowers  of  ordinals  are  well-ordered  under 
the  induced  ordering.  The  following  will  be  established: 

(*)  For  1 < n < co,  0J'  unjCm  has  ordertype  un+\  . 

Granted  this,  if  for  some  such  n,  \un\  = \un+\\,  then 

\un+i\  = PiG+i/CJ  = run/Cm\  = |m„+i| 

and  recursively  |m,!+!|  = \u„\  for  every  i e co,  so  that  \um\  = \un\  < con,  contra- 
dicting 28.7.  ft  follows  that  \un\  < | | for  1 < n < co,  and  this  together  with 
u n < con  for  1 < n < co  imply  the  desired  conclusion. 

Toward  establishing  (*),  it  is  first  observed  that  for  £ < um,  C L where 
L — Uae^a,  £[fl]  (as  defined  before  15.8).  For  f < a>2  this  follows  from  28.5.  For 
a>2  < | < uai,  proceeding  by  induction  on  %,  assume  first  that  for  some  a e mco,  t; 
is  not  a cardinal  in  L[a],  so  that  there  is  a bijection  g:  § ->  £ for  some  £ < £ with 
g e L[a],  Then  for  any  / e g o / e L[b]  for  some  b e “ co  by  induction,  and 
hence  f e L[c]  for  any  c e coding  a and  b.  Now  assume  that  £ is  a cardinal 
in  L[fl]  for  every  a e “&>.  Then  £ is  a uniform  indiscernible:  If  for  some  b e “<w, 
£ were  strictly  between  two  consecutive  members  of  the  corresponding  class  of 
indiscemibles  I *,  then  in  L[b#]  these  indiscemibles  are  of  the  same  cardinality, 
and  hence  f would  not  be  a cardinal  there.  With  a>2  < £ < um,  f = un  with 
2 < n < co  , and  so  it  follows  by  14.18(a)  that  cf(£)  = cf(/<2)  = co 2-  But  then, 
Wl£  = W1£,  and  so  the  result  follows  by  induction. 

Finally,  to  establish  (*)  the  argument  for  15.11  is  used.  Suppose  first  that 
h e 011  un-  Then  ft  e Z by  our  observation.  It  follows  from  14.18(b)  that  there  is 
an  a e “co  and  a Skolem  term  t\  such  that 

ft  = f1<L[a],e,a>(w1 u„ ) . 

A small  adjustment  leads  to  a Skolem  term  t such  that  for  any  £ < co\  (=  u\), 

h(i;)  = t<L[a]’e’a)(i;,UU...,Un ) . 

Now  consider 

ft (h , a,  t)  = t{Lla]-€’a)(uu  . . . , un+ 1)  < h„+1 

following  the  15.11  proof;  as  there,  if  ft  e m'un  with  {§  < &>i  | ft(£)  = ft(f)}  e Cm 
and  ft  (ft,  a,  I)  is  derived  from  some  a and  7 for  ft,  then 

ft(ft,  a,  t)  = rj(h,  a , t ) . 
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Hence,  the  map 

e.  un/C'a)\  ^ Wn- t-i 

given  by  e(\ji\)  — t](h , a,  t)  for  some  a and  t for  h is  an  order-preserving  bijection. 
This  completes  the  proof.  H 

This  synthetic  result  provided  a structural  description  of  the  first  u>  uncount- 
able cardinals  under  AD  through  the  theory  of  sharps,  and  accounted  for  the 
apparent  pathology  of  cf (<w„)  = a> 2 for  n > 3 through  Solovay’s  14.18(a). 

The  investigations  of  the  combinatorial  and  descriptive  set  theory  conse- 
quences of  AD  were  both  to  provide  a context  for  and  be  advanced  by  an  initia- 
tive from  a different  quarter,  the  study  of  strong  partition  relations  in  infinitary 
combinatorics. 

Infinite  Exponent  Partition  Relations 

Eugene  Kleinberg,  a student  of  Martin,  pursued  the  consequences  of  infinite  ex- 
ponent partition  relations.  The  Ramsey  property  for  sets  of  reals,  discussed  at 
the  ends  of  §§11,27,  evolved  from  the  possibility  00  — > (&>)“•  Although  such 
relations  contradict  AC  (7.1),  early  on  in  1968  Kleinberg  argued  for  their  intrinsic 
interest  at  uncountable  cardinals  and  showed  that  versions  imply  measurability  in 
a strong  sense  through  canonical  ultrafilters. 

For  X < k both  regular  let  denote  the  filter  over  k generated  by  the  sets 
L-closed  unbounded  in  k.  If  D is  such  a set,  then  letting  C consist  of  the  limit 
points  of  D other  than  k,  C C I {£  < k | cf(§)  = A}  c D.  Thus, 

CXK  = {X  c K | 3C  e CK(C  n {£  < k | cf(£)  = a)  c X)}  . 

With  AC,  through  arguments  that  choose  for  each  X e a corresponding  C eCK 
satisfying  Cfl{£  < k \ cf(§)  = X}  c X,  it  can  be  shown  that  C £ is  a k -complete,  in 
fact  normal,  filter  over  k;  on  the  other  hand,  {f  < k \ cf(£)  = 1}  can  be  partitioned 
into  disjoint  stationary  sets  by  16.9,  and  so  C £ is  not  an  ultrafilter.  Infinite  exponent 
partition  relations  contradict  AC,  but  establish  that  the  C £ witness  the  measurability 
of  k: 

28.10  Theorem  (Kleinberg  [70]).  Suppose  that  X < k,  X is  regular,  and  k — > 
(k) 2+>u.  Then  k is  measurable,  and  C £ is  a normal  ultrafilter  over  k. 

Proof.  First  observe  that  k — > (k)\  already  implies  the  regularity  of  k (by  the 
proof  of  7.6).  Next  note  that  k — > (/r)J;  for  every  y < k by  the  argument  for 
7.14(c): 

Suppose  that  /:  [at]a  — > y where  y < k.  Define  g:  [v]A-+;‘  — >■  2 by:  g(s)  = 0 
iff  /(s0)  = where  so,  si  e [s]x,  so  U si  = s,  and  Uso  < flsi.  Let  H e [/r]^ 

be  homogeneous  for  g.  Since  X ■ k — k > y,  there  must  be  to,  t\  e [//]*  such 
that  Uto  < flfi  and  /(to)  = /(ti).  Thus,  gito  U t\)  — 0,  and  so  by  homogeneity 
g“\H\+f  = {0}.  Now  for  any  u,v  e [//]A  there  is  by  the  regularity  of  k a 
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w e [//]A  satisfying  U u.  Uv  < fi w,  so  that  f(u)  — f(w ) = f(v).  Hence,  H is 
homogeneous  for  / as  well. 

We  next  show  that  C x is  a /(-complete  ultrafilter.  For  this  purpose,  it  suffices 
to  assume  that  F:  k —*■  y where  y < k,  and  find  a fi  < y such  that  F~l  ({/})  e Cfi 
To  do  this,  define  G : [/c]x  — >■  y by:  G(s ) — a iff  F(Gs)  — a.  By  k — * (/c)x, 
let  H e [k]k  be  homogeneous  for  G.  Set 

C = {S  <K  I U(ff  n$)  = £ A Cf(£)  = A)  . 

Then  C is  A-closed  unbounded  in  k,  and  if  G“[H ]l  = {yS},  then  C c F_1({/}) 
so  that  F’_1({/})  e Cf 

We  finally  show  that  Cx  is  normal.  Noting  that  the  regressive  function  char- 
acterization 5.10  holds  in  ZF,  it  suffices  to  assume  that  / e kk  is  regressive  and 
by  /e-completeness,  to  find  an  X e Cx  and  a S e k such  that  f“X  c 8.  To  do  this, 
define  g:  [/r]x  — > 2 by:  g(s)  — 0 iff  f(Us)  < fi.v . By  k — » (k)\,  let  H e [/c]^ 
be  homogeneous  for  g.  Taking  any  s e [H]x,  let  s — s — (/(Us)  + 1).  Then 
s e [H]k  as  /(Us)  < Us,  and  /(Us)  = /(Us)  < fls.  Thus,  g(s)  — 0,  and  so  by 
homogeneity  g“[H]x  = {0}.  But  then,  if  8 is  the  least  member  of  H and 

c = {£  < k | [J(H  n ?)  = f a cf(f ) = i} 

as  before,  then  C is  k-closed  unbounded  in  k,  and  it  is  simple  to  check  that 

rc  Ci.  H 

The  Cx’s  become  focal  when  k is  not  too  inaccessible: 

28.11  Exercise  (Kleinberg  [70]).  Suppose  that  k is  not  weakly  Mahlo  and  the  Cx  s 
for  regular  X < k are  normal  ultrafilters.  Then  they  are  the  only  normal  ultrafilters 
over  k. 

Hint.  Normal  ultrafilters  over  k include  CK,  and  by  hypothesis  {£  < k \ cf(f)  < 
H)eCK.  H 

These  results  would  have  remained  curiosities  had  it  not  been  for  a succession 
of  partition  theorems  that  was  to  span  the  accessible  transfinite.  It  was  established 
in  the  early  1970’s  that  under  AD  + DC  both  co\  and  ap  satisfy  substantial  infinite 
exponent  partition  relations,  and  later,  that  many  cardinals  satisfy  such  relations. 
The  first  inkling  for  co \ was  Martin’s  groundbreaking  result  in  late  1968  that 
under  AD,  co i — > (co i)“ • Kunen  then  extended  the  conclusion  to  co\  — > (co i)“ 
for  every  a < co\.  Then  in  1971,  building  on  Solovay’s  structure  result  28.5, 
Martin  established  the  following  assertion  of  strong  dichotomy  about  oo\ : 

28.12  Theorem  (Martin).  Assume  AD.  Then  co\  — > (m\  ff . 

Proof.  For  H e [ft/i]®1,  with  h:  co\  — > H enumerating  it  in  increasing  order  set 

WH  = {sup({//(«-§  + 0 | i e co})  | | < &>i}  e [wi]"1  , 
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i.e.  WH  consists  of  the  suprema  of  a>  consecutive  members  of  //. 

Suppose  now  that  /:  [cvi]"1  ->  2.  A C closed  unbounded  in  co\  will  be 
found  so  that  WC  is  homogeneous  for  /.  C will  consist  of  the  closure  ordinals 
provided  by  a Lj  bounding  argument  applied  to  a winning  strategy  for  a certain 
game  associated  with  /.  By  28.5  every  Y e is  in  some  L[a]  with  a e "cu 

and  so  can  be  anticipated  using  sharps  and  Skolem  terms. 

Toward  defining  the  game,  we  first  review  and  tailor  the  §§9, 14  sharp  theory 
relativized  to  reals.  An  EM  blueprint  for  a e “co  is  the  theory  of  some  structure 
(La[a],  e,  a,  Sk)kea>  where  8 is  a limit  ordinal  > co  and  {&-  | k e co]  is  a set  of 
ordinal  indiscemibles  for  {L&\a],  e,a)  indexed  in  increasing  order.  Here,  such  a 
theory  is  identified  with  some  z € “co  through  some  arithmetization  (so  that  a can 
be  recovered  from  z),  and  A4(z,  a)  denotes  the  model  unique  up  to  isomorphism 
generated  by  indiscemibles  in  ordertype  a (cf.  9.4).  For  z e “co, 

3a  e “co(z  = a#)  **  S(z ) A W(z) 

where  S is  arithmetical,  asserting  that  z codes  an  EM  blueprint  for  some  a contain- 
ing sentences  for  satisfying  the  §9  conditions  (I)  and  (III),  and  W is  n\ , asserting 
that  A i(z,  a)  is  well-founded  for  every  a < oo\  (cf.  14.11). 

When  S(z),  whether  M(z,  a)  is  well-founded  or  not,  it  has  a largest  initial 
segment  of  its  “ordinals”  which  is  actually  well-ordered.  Identifying  members  of 
this  initial  segment  with  corresponding  bona  fide  ordinals,  temporarily  write 

j]  <is  M(z,a) 

to  indicate  that  is  a member  of  the  initial  segment. 

Let  (tn  | n e co)  enumerate  the  Skolem  terms  (like  those  described  after  9.3, 
but)  for  £e(A)  and  in  one  free  variable.  For  z e 10 co,  let  z+  e “co  be  defined  by 
z+(i)  = z(i  + 1),  so  that  z = (z(0))"z+.  Now  define  for  a < a>\: 

Aa  — [z  e “co  | S(z+ ) A a <is  M(z+ , a + co)  A 

3??  > cx{t^z+ 'a+to) {a)  = rj  <is  M(z+,a  + co))}  . 

The  significance  of  these  sets  becomes  clear  through  their  role  in  the  following 
game  associated  with  /:  [wi]"1  2: 

The  players  choose  integers  as  usual: 

I : x (0)  x(2) 

II:  x(l)  *(3) 

Here,  I is  regarded  as  playing  an  integer  x(0)  and  a real  (x/)+,  and  II  as  playing 
an  integer  x(l)  and  a real  (xn)+.  There  are  two  cases  in  the  payoff  scheme: 

(i)  There  is  an  a such  that  either  x\  Aa  or  xu  £ Aa.  Then  for  the 
least  such  a,  II  wins  if  xj  £ Aa,  and  otherwise  / wins. 
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This  case  leads  to  the  crucial  bounding  argument.  Loosely  speaking,  a player 
wins  if  his  real  satisfies  S and  generates  structures  with  well-founded  parts  “more 
closed”  under  his  Skolem  term. 

Suppose  that  z e for  every  a < co\.  Then  since  a <jS  M(z+,  a + co)  for 
such  a,  all  the  M(z+ , fiY s are  well-founded  as  they  are  embeddable  into  each 
other,  and  so  there  is  an  a such  that  z+  = a#.  Moreover,  as  M(z+,  a + co)  -< 
(L[a],  e,  a),  a < (a)  < oo\  for  every  a < co\.  These  remarks  lead  to  the 

other  case: 

(ii)  ( xi)+  = a#  and  (v//)+  = b#  for  some  a,b  e wco.  Then  defining 
Px-  o)\  -»  an  by 

Px(H)  = sup({t^Q)fl]’e’ff> iwfi  + i)  | i e a)}  U {t[L$le'b) (cofi  + i)  | i e w})  , 

I wins  if  f(r&n{px))  = 0 and  II  wins  if  /(ran(px))  = 1. 

It  will  be  argued  that  a winning  strategy  for  I provides  a requisite  homogeneous 
set  with  / value  0 (and  the  argument  will  show  that  a winning  strategy  for  II 
provides  a requisite  homogeneous  set  with  / value  1). 

Suppose  then  that  a is  a winning  strategy  for  I.  As  anticipated,  a L[  bounding 
argument  is  first  applied  to  define  a closed  unbounded  subset  of  &>i  (using  the 
notations  Er,  ||r||,  and  WF  as  e.g.  in  the  first  proof  of  28.2).  Define  for  rj  < w\\ 

Bn  — {r  e wcti  | 3z  e a>cti3o/  < ri(S(z+ ) A a <is  M. (z+ , a + oo) 

A t^z+,a+ro\a)  = rj  <is  M(z+ , a + a>) 

A Er  has  ordertype  ’"+<u)(o:)}  . 

Then  Bn  is  Sj  in  a real  coding  the  ordertype  The  various  assertions  about 
ordinals  can  be  cast  in  terms  of  reals  coding  them  and  embeddings  (cf.  the  second 
proof  of  13.4).  Note  that  since  a is  winning  for  I and  the  conditions  on  z ensure 
that  z e Aa,  ^<fT*z>,>  £'+'u)(a:)  is  a bona  fide  ordinal  by  clause  (i).  This  implies 
that  Bv  C WF,  so  that  as  in  previous  arguments  a function  w:  co \ — >■  co\  can  be 
defined  such  that  if  r e Bv,  then  ||r||  < w(p).  Set 

C = {y  < a. u | Vrj  < y(w(rj)  < y)}  . 

Then  C is  closed  unbounded  in  w \,  and  the  proof  is  completed  by  showing  that 
coC  is  a homogeneous  set  for  / with  value  0. 

Suppose  that  p:  a>\  ->  mC  is  increasing;  it  must  be  shown  that  /(ran( p))  — 0. 
By  definition  of  WC  there  is  an  increasing  function  q:  co \ — >■  C such  that  for  any 

I < 0)\, 

P(%)  = sup({g(tt>-|  + i)  | i e co})  . 

q e L[b]  for  some  b e OJco  by  28.5,  and  in  fact  the  last  paragraph  of  its  proof 
provides  a specific  definition  via  forcing  so  that  for  some  k e co, 

Vff  < coi(t{kL[h]'e'h)  (a)  — q(a))  . 
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Set  z — (k)~b#.  Then  since  a is  winning  for  /,  (er  * z)i  = (cr(0))''a#  for  some 
a e 10 co.  For  pa*z  as  in  the  statement  of  (ii)  it  now  suffices  to  show  that  p — pa*z, 
for  then  /(ran(/;>))  = /(ran  (/?„,.))  = 0 as  a is  winning  for  I. 

Suppose  then  that  f < co\.  For  each  i e co, 

co-i;  + i < tlL[bl€'b)  (co-%  + i)  = t^(b*'w^+l)\co^  + i ) , and 

$ro],6,“Vs  + 0 = C)a#,a,'(*+1)V£  + *') 

as  M{b# , &>•(/  + 1))  -<  ( L\b ],  e,  b),  and  similarly  for  a.  Hence, 

+ i)  < w(q (co-t;  + 0)  < p(H)  , 

the  first  inequality  by  the  choice  of  k and  the  definitions  of  Bq^+t)  and  w, 
and  the  second,  as  «(&>/  + i)  < p($)  and  p(t; ) e C.  Thus  /;(§),  being  the 
supremum  of  the  t[L^'e'h\co-%  + /)’ s,  equals  pa*z(H),  which  is  the  supremum  of 
the  + 0’s  and  tj.L^'e'h\co-^  + z)’s.  This  completes  the  proof  of  the 

theorem.  H 

This  proof  was  to  be  generalized  to  higher  cardinals  using  analogues  of  the 
£}  bounding  argument  and  the  coding  of  functions:  u>\  — »■  a>\  by  reals  (30.26). 
The  proof  admits  a stronger  conclusion: 

28.13  Exercise.  Assume  AD.  Then  co\  — > (cui)^!,,.  an d so  0Ji  — > (fib)®1  for 
every  a < u>\. 

Hint.  Suppose  that  /:  [co\\m  — »■  "2.  For  each  n e co  let  /„:  [<ui]Wl  ->  2 be  defined 
by  fn(s)  — /(s)(n),  and  by  the  proof  of  28.12  let  Cn  be  closed  unbounded  in  a>\ 
so  that  MC„  is  homogeneous  for  /„.  Then  C„)  is  homogeneous  for  /. 

For  the  latter  assertion,  note  that  any  countable  ordinal  is  injectible  into  coa>. 

H 


The  function  /:  [u>i]<Ul  -*  <o\  defined  by  f (s)  — fly  is  a counterexample  to 
co\  — » {(Oi)l\,  but  recall  that  co\  / 2K°  under  AD. 

Martin  had  also  shown  by  then,  using  the  emerging  structure  theory  for  sets 
of  reals  (§§29,30),  that  under  AD  + DC  there  are  many  cardinals  k satisfying 
Va  < w\(k  — » (/c)“). 

In  the  next  advance,  Martin  and  Paris  built  on  28.12  by  taking  an  ultrapower 
by  Co, j to  establish  that  under  AD  + DC,  Va  < 002(0)2  — » (&>2)“)-  As  28.12 
did  for  <o\,  this  result  considerably  strengthened  the  result  that  002  is  measurable, 
and  implied  through  28.10  and  28.11  that  there  are  exactly  two  normal  ultrafilters 
over  002,  C ^ and  On  the  other  hand,  Kunen  showed  that  ^—^(coT)^.  (See 
Kechris  [78]  for  these  various  results.) 

Kleinberg  showed  in  1975  that  many  of  these  results  under  AD  are  actually 
consequences  solely  of  Martin’s  co\  — > Owi)-?1-  Fie  emphasized  the  generality, 
and  the  Jonsson  and  Rowbottom  partition  properties: 
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28.14  Theorem  (DC)(Kleinberg  [77,77a]).  Suppose  that  k > co  and  k — > (k)2. 
Then  there  are  cardinals  k — K\  < k2  < K3  < ■ ■ . such  that: 

(a)  K2  as  well  as  k\  are  measurable,  and  moreover  K2  — >■  (*2)2  for  every 
a < o>i. 

(b)  Kn  for  n > 3 are  singular  Jdnsson  cardinals  of  cofinality  K2  and 
sup({/c„  | n e ft>})  is  ci  Rowbottom  cardinal. 

Moreover,  if  there  is  a normal  ultrafilter  U over  k such  that  kk/U  has  ordertype 
k+,  then: 

(c)  Kn+ 1 = k+  for  n > 1. 

(d)  K2  — > (*2)2  for  every  a < k2. 

(e)  Every  normal  ultrafilter  over  k\  or  K2  is  of  form  C*. 

For  k = co\  under  AD,  (c)  - (e)  apply  since  0>'co\  /C(U]  has  ordertype  U2  — co2 
by  (*)  in  the  proof  of  28.9.  Some  surface  remarks  are  made  about  the  web 
of  connections  described  by  28.14:  (b)  and  (c)  provide  a purely  combinatorial 
explication  of  cf (&>„)  = co 2 for  n > 2 (28.8).  The  arguments  for  (a)  and  (b)  are 
implicit  in  the  Martin-Paris  result  of  the  special  case  a> 2 — > (0)2)“  for  every  a < 
co 2.  Generalizing  co2—/^(co2)22,  Henle  [79]  showed  that  k2— delimiting 
(d).  As  described  in  Kleinberg  [77a]  the  proof  of  (b)  was  simplified  by  another 
result  of  Henle,  one  that  generalizes  28.13:  If  k — > (k) 2,  then  k — > (k)2» 0- 
Also,  Everett  Bull  [78: 171]  showed  that  the  existence  of  a U as  described  implies 
that  k is  a successor  cardinal,  so  that  (e)  follows  from  28.11. 

The  combinatorial  consequences  of  k — > (, k)2  and  its  variants  were  further 
studied  by  Henle  [77, 79, 84],  Kleinberg  [81, 82],  and  Henle-Kleinberg-Watro  [84], 
As  for  consequences  of  AD  along  these  lines,  it  was  known  by  1977  that  AD 
implies  that  there  are  many  cardinals  k such  that  k — >■  (k)2.  (However,  not  until 
the  work  of  Jackson  [88]  did  it  become  known  that  there  are  such  cardinals  greater 
than  &>i  that  satisfy  the  “Moreover,  ...”  clause  of  28.14.)  Such  results  about  AD 
and  infinite  exponent  partition  relations  are  discussed  toward  the  end  of  §30. 

On  0 

With  AD  seen  to  cast  a long  shadow  over  the  transfinite,  we  next  turn  to  the  early 
analysis  of  its  extent.  In  gauging  the  continuum  with  respect  to  the  ordinals  the 
consequence  a>\  f 2K°  of  AD  implies  that  the  continuum  is  small  with  respect 
to  possibilities  for  injections:  | — »■  wco.  On  the  other  hand,  there  is  always  a 
surjection:  “ co  —>  00 1 (sending  reals  coding  well-ordered  sets  into  their  ordertypes), 
and  the  length  of  the  continuum  can  be  considered  in  terms  of  the  possibilities  for 
surjections:  wco  —>  f . Moschovakis  [70]  drew  attention  to  the  following  ordinal: 

0 — sup({§  | there  is  a surjection:  “co  -*  £})  . 

A surjection:  <°co  — »■  § amounts  to  a stratification  of  wco  into  well-ordered  layers, 
and  so  corresponds  to  a well-ordering  of  some  subset  of  Vl^co).  It  follows  by 
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Replacement  that  in  ZF,  0 is  indeed  an  ordinal.  Moreover,  it  is  readily  seen  that 
in  ZF,  0 is  a cardinal  > a>2,  and  that  in  ZFC,  0 — (2K°)+.  The  determination 
of  0 in  the  sequence  of  alephs  can  be  regarded  as  a version  of  the  Continuum 
Problem  in  settings  without  AC. 

0 came  to  be  seen  in  various  ways  as  the  upper  bound  on  the  effect  of  AD 
on  the  transfinite,  and  0 under  AD  also  turns  out  to  be  very  large.  The  first  results 
along  these  lines  were  established  independently  by  H.  Friedman  and  Moschovakis 
in  1968.  The  latter  soon  isolated  his  fundamental  Coding  Lemma,  which  figures 
crucially  in  many  results  concerning  0 and  its  definability  analogues  (see  before 
30.16).  The  proof  of  the  lemma  used  what  came  to  be  known  as  a Friedman 
game , a game  where  the  players  must  play  codes  for  countable  ordinals  but  unlike 
in  a Solovay  game,  either  player  can  have  a winning  strategy.  Such  a game  was 
first  used  in  the  following  striking  result,  and  its  anticipations  by  H.  Friedman: 

28.15  Theorem  (Moschovakis  [70]).  Assume  AD.  If  there  is  a surjection:  wco  -»  a, 
then  there  is  a surjection:  “co  —>■  V{a). 

Proof.  Let  /:  ma>  —>  a be  a surjection.  A sequence  {g%  \ % < a)  will  be  recursively 
defined  so  that  each  g $ is  a surjection:  wco  ->  V(^).  Let  go:  wa>  — > "P(O)  be  the 
only  possible  map.  At  successor  stages,  define  directly  if  £ is  finite,  and  in 
terms  of  g % via  a bijection:  £ — > £ + 1 defined  uniformly  in  £.  It  remains  to 
define  g$  for  limit  <5  with  0 < 8 < a: 

First,  for  £ <8  and  y c f temporarily  stipulate  that  z e codes  y iff 
/((z) o)  = § and  g|((z)i)  = y.  Now  for  X c 8 consider  a game  Gx  where  the 
players  choose  integers: 

I : x (0)  x(2) 

II:  x{\)  x(3) 

I loses  unless  xi  codes  X fi  § for  some  f < 8,  in  which  case  II  loses  unless  xu 
codes  X D r]  for  some  ;/  with  f < rj  < 8. 

Suppose  first  that  I has  a winning  strategy  a . Then 

Vy  e “ co  ((a  * y)/  codes  X fl  § for  some  £ < 8)  , 

and  X is  the  unique  subset  of  8 with  this  property:  If  Y C 8 with  Y f X,  let 
f e Y AX  and  y e “&>  code  X fi  (£  + 1).  Then  (o  * y)j  must  code  X n§  for  some 
f > C,  and  xn%  f fn(. 

Suppose  next  that  II  has  a winning  strategy  r.  Then 

Vz  e “o)(z  codes  for  some  § < 8 -* 

(z  * r)//  codes  Afi?;  for  some  ?;  with  £ < ?;  < 8) 

and  X is  the  unique  subset  of  8 with  this  property:  If  Y C 8 with  Y f X,  let  £ 
be  the  least  member  of  Y AX  and  z e “a  code  Y n £(=  X n f ).  Then  (z  * r )// 
must  code  X n rj  for  some  ?;  > £,  and  X n i]  f Y n rj. 
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It  follows  that  for  any  strategy  for  either  player,  if  it  is  winning  in  some 
Ga,  then  that  X is  unique  as  it  can  be  reconstructed  from  the  strategy.  Thus,  the 
function  g:  {p  \ p is  a strategy  for  either  I or  II}  ->  V(S)  given  by: 

X if  p is  a winning  strategy 
g(p)  — for  either  I or  II  in  Gx  , and 

. 0 otherwise  . 

is  well-defined  and  by  AD,  surjective.  The  domain  of  g can  be  reconstituted  to 
get  a surjective  g$:  wco  ->  V(&),  completing  the  recursive  construction.  H 

There  is  a simple  corollary: 

28.16  Exercise  (H.  Friedman).  Assume  AD.  Then  © is  a limit  cardinal. 

Hint.  Suppose  that  X < 0,  and  show  that  X+  < ©\  By  the  theorem  there  is  a 
surjection:  wco  -»  V{X).  First  compose  it  with  a bijection:  V(X)  —>  V(X  x X) 
induced  say  by  the  Godel  pairing  bijection:  X -*  X x X.  Then  compose  with  that 
function  mapping  R e V{X  x X)  to  its  ordertype  if  R is  a well-ordering,  and  to  0 
otherwise.  H 

The  arguments  for  28.15  and  28.16  actually  show  that  from  a surjection: 
10 co  ->  X a surjection:  ® co  ->  A.+  can  be  defined.  A more  substantial  corollary 
follows  from  this  observation: 

28.17  Exercise  (Solovay).  Assume  AD.  Then  © — co@. 

Hint.  Suppose  that  a < 0,  and  show  that  coa  < ©'■  Using  a surjection:  — »■  a, 

recursively  define  a sequence  (gj  | £ < a)  such  that  each  is  a surjection: 

w(0  -»  COj;  . H 

Applying  his  Coding  Lemma  Moschovakis  [70]  extended  further  results  of 
Solovay  by  showing  that  under  AD  there  are  cardinals  k that  are  K-weakly  Mahlo 
below  0.  However,  it  turned  out  that  the  cofinality  of  0 may  be  small  without 
further  hypotheses.  The  following  uses  the  expected  argument,  depending  only  on 
ACW: 

28.18  Exercise  (DC).  cf(0)  > co.  H 

We  had  mentioned  L( R)  as  the  natural  inner  model  for  AD  in  the  ZFC 
context. 

28.19  Exercise  (Solovay).  Assume  V — L(Sl).  Then  0 is  regular. 

Hint.  It  suffices  to  show  ( sans  AC!)  that  there  is  a sequence  (/ij  | 0 < £ < 0 ) 
such  that  each  is  a surjection:  wa>  ->  f , for  then  the  regularity  of  0 can  be 
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established  in  the  expected  way.  To  get  such  a sequence  let  <P\  On  x 10  co  — »■  V 
be  a definable  surjection  (cf.  the  proof  of  11.13).  Define  t:  0 — {0}  ->  On  by: 
r(§)  is  the  least  ordinal  such  that  for  some  y e wco,  <£(?(£),  y)  is  a surjection: 
wco  -»  | . Then  define  h ^ — > £ for  0 < f < 0 by: 


M0HW1) 

0 


if  0(t(^),  (x)o)  is  a map  into  f , and 
otherwise  . 

H 


Thus,  although  @L(R)  < 0 may  be  a small  ordinal,  it  follows  from  ADL(R) 
that  @i(R)  is  a weakly  inaccessible  cardinal  in  L(R).  The  possibilities  for  © in 
L(K)  were  later  to  be  considerably  clarified  (30.32). 

Introducing  sharps  for  sets  of  reals  Solovay  [78a]  showed  that 

ZF  + ADr  + cf(0)  > co  b Con(ZF  + ADR)  , 

and  so  by  Godel’s  Second  Incompleteness  Theorem, 

ConfZF  + ADr)  implies  Con(ZF  + ADr  + cf(@)  = co)  . 

By  28.18,  ConfZF  + ADr)  thus  implies  ConfAD  H — 'DC),  the  independence  of 
DC  from  AD.  Steel  and  Robert  Van  Wesep  in  their  [82]  established  related  results. 

Strong  and  Supercompactness  under  AD 

This  section  is  concluded  with  combinatorial  versions  of  strong  and  supercompact- 
ness derivable  from  AD.  The  equivalences  in  terms  of  elementary  embeddings  of  V 
are  no  longer  available,  the  proof  of  the  basic  Los’s  Theorem  5.2  depending  on  AC. 
Thus  bereft,  the  strength  of  these  combinatorial  versions  is  unclear.  The  results  of 
the  late  1980’s  about  the  relative  consistency  of  AD  (§32)  were  to  indicate  that,  at 
least  for  supercompactness,  there  is  not  much  consistency  strength.  Nonetheless, 
it  is  interesting  that  AD  has  these  other  ways  of  affecting  the  transfinite. 

In  terms  of  an  early  characterization  (4.1)  of  strong  compactness,  Kunen 
observed  in  1970  that  under  AD,  co\  is  strongly  compact  up  to  the  natural  bound 
0.  The  proof  uses  Martin’s  ultrafilter  over  Turing  degrees  to  project  down  as 
before,  and  has  a further  consequence. 

28.20  Theorem  (Kunen).  Assume  AD.  Then  for  any  X < 0,  any  a>\-complete 
filter  over  X can  be  extended  to  an  co\-complete  ultrafilter  over  X. 

Proof.  By  28.15  let  g : wco  -*  V(X)  be  surjective.  Suppose  now  that  F is  an 
co\ -complete  filter  over  X.  Recalling  Martin’s  28.4  and  its  terminology,  define 
f:V x -*  A by: 

f(d)  = least  ordinal  in  p|{,?(fl)  e F \ a e mco  A [a]T  < d } . 

(Since  there  are  countably  many  a e^co  such  that  [u]T  < d,  this  intersection  is 
not  empty  by  co\  -completeness.)  Then  for  any  X e F,  with  a e “’co  such  that 
g(a)  — X,  f“{d  e Vj  | [«]t  < d]  c X.  With  this  it  is  simple  to  check  that 
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/*(Mt)  = {Xca  | r\X)eMj} 

is  an  a)\  -complete  ultrafilter  over  X extending  F.  H 

28.21  Corollary  (DC)(Kunen).  Assume  AD.  Then  for  any  X < 0, 
f(X)  — {U  | U is  an  ultrafilter  over  1} 


is  well-orderable. 

Proof.  By  the  proof  of  the  theorem,  any  ultrafilter  over  X,  being  an  -complete 
(28.1),  is  of  form  ffiMj)  for  some  /:  Vj  — > X.  It  is  simple  to  see  that  for 
any  two  functions  / and  g with  domain  Vj  if  {d  e Vj  \ f(d)  — g(r/)}  e MT, 
then  fi(Mj)  — g*(MT).  Hence,  the  map  with  domain  the  ultrapower  T>'1 X/Mj 
sending  [/]mt  to  is  a well-defined  surjection  onto  f(X).  But  DC  ensures 

that  ultrapowers  of  ordinals  via  an  -complete  ultrafilters  are  well-ordered  under  the 
induced  ordering.  Hence,  the  surjective  image  f(X)  is  also  well-orderable.  H 

Thus,  under  AD,  if  X < 0,  then  V(X)  is  not  well-orderable,  but  f>(X)  is. 
Under  AC,  for  any  k,  that  \f{n)\  — 22*  is  a classical  result  of  Pospisil  [37] 
and  Tarski  [39].  Under  AD  it  may  happen  that  \P(k)\  — k.  The  f function  on 
cardinals  less  than  0 has  been  latterly  regarded  as  a power  set  operation  (see 
Kechris  [85]).  Assuming  DC  and  a hypothesis  intermediate  between  AD  and 
ADr,  Kechris  [88a]  established  the  following  result  analogous  to  28.15:  If  there 
is  a surjection:  w a>  ->  X,  then  there  is  a surjection:  u’a>  —*■  f{X). 

Turning  to  supercompactness,  the  existence  under  AD  of  normal  ultrafilters 
over  various  VKy  was  established  in  the  latter  1970’s.  Solovay  had  proved  the 
following  in  one  of  the  early  uses  of  the  stronger  AD  : . 

28.22  Theorem  (Solovay  [78a:  §3]).  Assume  ADR.  Then  for  any  y < 0 there  is 
a normal  ultrafilter  over  V^y. 

Proof  Solovay  actually  showed  that  there  is  an  an  -complete  ultrafilter  U over 
Vwfico  that  has  the  following  fineness  and  normality  properties: 

(i)  Va  e W«([A  e V^fco  \ a e X}  e U)  . 

(ii)  If  (Aa  | a e *co)  e “aU,  then  {X  e VMfiaj  \ X e Hflex  Aa]  e U . 

Such  a U plays  a role  similar  to  Martin’s  Mj  in  Kunen’s  28.20:  For  y < 0,  let 
/:“&>-*)/  be  surjective;  then  it  is  straightforward  to  show  that 

{ Y c Vmiy  | [v  e Vtofw  \ f“x  e Y}  e U) 

is  a normal  ultrafilter  over  V„H  y . 

Toward  the  formulation  of  U,  for  A C V0)i  01  a>  consider  a game  G(A)  where 
the  players  choose  finite,  possibly  empty,  subsets  of 01  or. 
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I : s0  s2 

II:  si  «3 

II  wins  if  {Jiew  e A,  and  otherwise  I wins.  Coding  finite  subsets  of  reals  by 
reals,  ADR  implies  that  G(A)  is  determined.  Now  define  U c V(VmOJoo)  by: 

A e U iff  II  has  a winning  strategy  in  G(A)  . 

Clearly  property  (i)  obtains,  and  if  A e U and  A c B c Vwlmco,  then  B e U. 
Moreover,  if  A C VWlcoco  and  A $ U , then  V0H  OJoj  — A e U,  for  if  a is  a 
winning  strategy  for  I in  G(A),  then  the  following  is  a winning  strategy  for  II 
in  G(Vma(o  — A):  After  P s initial  move  so,  H moves  er(0),  and  after  P s next 
move  .s'2,  II  moves  cr({cr(0),  so  U S2»,  and  thereafter  according  to  a. 

We  next  show  that  U is  an  a>\ -complete  ultrafilter  by  verifying  co\ -complete- 
ness, the  argument  soon  to  be  extended  to  establish  (ii):  Suppose  that  (Am  \ m e 
co)  e WU . Consider  a game  where  /’  s initial  move  must  be  an  integer,  say  m, 
II  must  respond  with  0,  and  then  the  players’  moves  and  result  are  as  in  G(Ant). 
It  is  simple  to  see  that  it  is  II  who  must  have  a winning  strategy  in  this  new 
game,  and  that  from  it  a sequence  (r„,  | m e co)  can  be  read  off  where  each  rm 
is  a winning  strategy  for  II  in  G(Am).  (This  strategem  avoids  ACW.)  A winning 
strategy  for  II  in  Gif' \m  Am ) can  now  be  formulated: 

Recall  our  recursive  coding  of  co  x co  with  (m,  n)  e co  coding  the  ordered 
pair  of  in  and  «;  it  can  be  assumed  that  (m,  n)  < (m,  n + 1)  for  every  m,  n e co. 
For  7/’s  ith  move  (the  (2 i + 1 ) st  move  of  the  game),  if  i — (m,  n),  II  makes  the 
strategic  response  according  to  rm  to  the  partial  play  up  until  then,  but  with  all  the 
reals  played  by  either  player  after  IPs  ( m , n — l)th  move  (or  0th  move,  if  n = 0) 
taken  together  as  a single  move  by  I.  Thus,  IP s (m,  0)th,  {m . l)th,  (m,  2)th,  . . . 
moves  ensure  that  the  resulting  set  belongs  to  Am. 

(ii)  is  established  by  building  on  this  argument:  Suppose  that  (Aa  \ a e 
m co ) e “®f/.  Playing  an  auxiliary  game  as  before  but  with  /’ s initial  move  a real 
a , we  get  a sequence  (ra  | a e a’co)  such  that  each  r„  is  a winning  strategy  for 
II  in  G(Aa).  A winning  strategy  for  II  in  G({X  e VQ)I mco  \ X e f^aeX  Aa}) 
can  now  be  formulated  as  follows:  With  the  countable  set  of  A„’s  into  which  the 
union  of  the  resulting  play  must  get  only  determined  by  the  progress  of  the  game, 
II  applies  a dovetailing  procedure  to  ensure  that  whenever  a real  a occurs  in  a 
move  of  either  player,  II  eventually  applies  xa  infinitely  many  times.  As  before, 
each  application  of  ra  is  based  on  regarding  all  the  reals  played  after  the  previous 
application  taken  together  as  a single  move  of  I.  ~\ 

Whether  the  conclusion  already  follows  from  AD  is  unknown.  Martin  ob- 
served in  1975  that  in  ZF,  if  k and  k+  are  both  measurable,  then  there  is  a normal 
ultrafilter  over  Vkk+.  A normal  ultrafilter  over  k+  can  be  used  to  “glue  together” 
normal  ultrafilters  over  VKy  for  k < y < k+  canonically  defined  from  a normal 
ultrafilter  over  k (see  Di  Prisco-Henle  [78]).  It  thus  follows  from  AD  that  w\  is 
“a>2 -supercompact”,  but  also  it  was  not  difficult  to  simulate  the  game  of  28.22 
for  small  y to  draw  its  conclusion  for  such  y . In  a significant  advance  for  the 
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general  theory  Harrington-Kechris  [81]  showed  that  certain  games  played  with 
reals  coding  ordinals  uniformly  bounded  below  0 can  be  simulated  with  games 
played  with  integers,  and  so  are  determined  under  AD.  Through  this  means  they 
simulated  Solovay’s  game  to  show  that  under  AD  there  is  a normal  ultrafilter  over 
Van  8 where  8 < 0 is  a definability  analogue  of  0,  weakly  Mahlo  but  not  the 
least  such.  Becker  [81]  applied  Martin’s  ultrafilter  MT  instead  of  Solovay’s  game 
to  get  the  same  result,  and  showed  in  [81a]  that  this  approach  shows  that  under 
AD  there  is  a normal  ultrafilter  over  V0)1 8 . Becker  [81]  also  showed  that  under 
AD,  for  several  y there  is  exactly  one  normal  ultrafilter  over  Vm]  y,  and  Woodin 
[83]  extended  this  result  to  all  y covered  by  the  Solovay  result  under  ADr  and 
by  the  Harrington-Kechris  result  under  AD. 

These  various  results  as  well  as  28.7,  from  which  we  have  drawn  conclusions, 
are  very  much  part  of  the  study  of  definability  under  AD,  the  subject  of  the 
succeeding  section. 
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This  and  the  next  sections  explore  the  structural  consequences  of  determinacy  hy- 
potheses in  descriptive  set  theory,  a direction  of  investigation  inspired  by  Blackwell 
[67]  and  first  pursued  by  Moschovakis  and  Martin.  A theme  broached  in  §27  is 
broadly  developed,  that  of  regarding  determinacy  itself  as  a regularity  property 
and  investigating  the  consequences  of  Dct( A ) for  various  A,  particularly  those 
in  the  projective  hierarchy.  The  Moschovakis  text  [80]  provides  an  extensive 
account,  and  most  of  the  work  before  the  crowning  achievements  of  the  later 
1980’s  appears  in  the  proceedings  of  the  Cabal  Seminar:  Kechris-Moschovakis 
[78],  Kechris-Martin-Moschovakis  [81,83],  and  Kechris-Martin-Steel  [88]. 

The  conventions  of  §§12, 13  are  maintained,  and  as  there 

ZF  + serves  as  the  ambient  theory  for  this  section  . 

AD  implies  AC^i^co)  (27.10),  and  versions  of  this  observation  for  definable  sets 
of  reals  give  the  first  equivalence  of  (*)  before  12.4  for  definable  relations.  It  will 
follow  that  several  of  the  forthcoming  results  featuring  determinacy  hypotheses 
can  be  established  in  just  ZF,  but  this  is  not  traced.  Uses  of  DC  are  explicitly 
noted  however. 

The  focus  will  be  on  the  projective  sets,  and  PD,  Projective  Determinacy, 
will  suffice  for  the  main  development  of  this  section.  In  any  case  results  are 
stated  parsimoniously  with  determinacy  hypotheses  only  as  needed.  To  cast  the 
discussion  in  some  generality,  a projective  class  is  some  or  II,1,  for  1 < n e a>, 
or  for  the  effective  content,  (a ) or  /7;J(u)  for  some  a e wa>  restricted  to  the 
traditional  relations  c k(‘°co),  which  are  again  referred  to  as  (a)  or  /7;j(u) 
respectively.  For  r such  a class,  —<r  denotes  its  dual  class,  i.e. 

->r  = [A  | 3 k e a> (A  c k(coco)  A k(wco)  — A e T)}  , 

and  and  V1/"  those  classes  corresponding  to  the  adjunction  of  the  correspond- 
ing quantifier,  i.e. 

3 *r  = [A  | 3fi  e r(A  = PB )}  , and 

v*r  = -3 . 

For  example,  3 'll]  is  £4,  and  V1 17^ (a)  is  again  77] (a).  It  will  be  convenient  to 
have  some  standing  meta-variables  for  this  section;  as  above, 

r denotes  a projective  class  , 
and 


Ap  denotes  r fi  —>r  . 

For  example,  Avi  is  Al.  The  Moschovakis  text  [80]  formulates  results  more 

2.3 

abstractly  in  terms  of  general  classes  satisfying  closure  and  other  properties,  and 
these  generalizations  are  suggested  by  the  formulations  here. 
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Determinacy  hypotheses  for  projective  classes  and  their  duals  do  not  have  to 
be  considered  separately  by  a simple  observation: 

29.1  Lemma.  Det(F)  iff  Det(-r). 

Proof  For  any  A C set 

A~  — x | i e co  a x e A]  . 

Then  it  is  readily  seen  that  a winning  strategy  for  I in  GW(A~)  converts  to  a 
winning  strategy  for  II  in  Gw(<u&)  — A),  and  that  a winning  strategy  for  II  in 
GW(A~)  converts  to  a winning  strategy  for  I in  Gm (Mo>  — A).  But  A~  e 7"  iff 
Ae  F.  H 

Toward  giving  the  seminal  Blackwell  argument  and  beginning  the  appropriate 
structural  approach,  two  simple  properties  are  discussed  that  were  widely  studied 
in  classical  descriptive  set  theory:  For  any  A and  B,  (A*,  B*)  reduces  (A,  B)  iff 
A*  C A,  B*  C B,  A*  U B*  = A U B,  and  A n B = 0. 


r has  the  reduction  property 

iff 

for  any 

A,  B in 

r there  are  A*,  B*  in  F 

so  that 

(A*,  B*) 

reduces  (A,  B)  . 

r has  the  separation  property 

iff 

for  any 

disjoint 

A,  B c a(m®) 

with  A. 

, B in  r 

there  is  a C in  Ap 

so  that 

ACC  and  5dC  = 0. 

Simple  diagrams  illustrate  these  properties,  and  the  following  is  an  exercise  in 
propositional  logic. 

29.2  Exercise.  If  r has  the  reduction  property,  then  —<r  has  the  separation 

property.  H 

An  early  argument  using  universal  sets  was  recast  for  the  following: 

29.3  Proposition  (Novikov  [35:465],  Addison  [58]).  If  P has  the  reduction 
property,  then  r does  not  have  the  separation  property. 

Proof.  Let  U c 2(®<y ) be  universal  for  r (12.7),  and  set 

A = {{x,  y)  | ((x)o,  y)  e U}  and  B = {( x , y)  | {(x)i,  y)  e U}  . 

A and  B are  in  F,  so  let  A*  and  B*  in  r be  such  that  (A*,  B*)  reduces  (A,  B). 
Assume  to  the  contrary  that  there  is  a C in  Ap  so  that  A*  C C and  B*  (T  C = 0. 
Then  C C 2(“<y)  is  universal  for  Ap'. 

For  any  DC®®  with  D in  Ap,  let  a e 01  co  be  such  that  for  any  y e 01  co. 


y e D o-  ((o)o,  y)  e U , and  y £ D **  ((a) i,  y)  e U . 
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Then  for  such  a,  (a,  y)  e A iff  (a,  y)  B,  so  that  (a,  y)  e A*  iff  (a,  y)  f.  B*, 
and  consequently  y e D o-  (a,  y)  e C . 

However,  there  can  be  no  set  universal  for  Ar  by  Cantor’s  diagonal  argument 
(as  noted  after  12.8).  H 

29.4  Corollary.  If  F has  the  reduction  property,  then  —>F  does  not  have  the 
reduction  property. 

Proof  By  29.2  and  29.3.  H 

Luzin  [27:50ff]  formulated  and  established  the  separation  property  for 
(cf.  before  13.5).  Kuratowski  formulated  the  reduction  property  and  established 
the  following,  which  subsumes  Luzin’s  result  by  29.2.  Addison,  as  with  29.3,  later 
noted  the  effectivization.  In  a move  that  brought  on  a conceptual  shift,  Blackwell 
provided  a proof  based  on  the  determinacy  of  open  games: 

29.5  Theorem  (Kuratowski  [36],  Addison  [58]).  For  any  a e "’co,  n\  (a)  has  the 
reduction  property. 

Proof  (Blackwell  [67]).  Suppose  that  A and  B are  n\{a),  taking  A,  B C "’co 
for  simplicity.  Let  T and  U be  trees  on  co  x a>  as  provided  by  the  basic  13.1 
representation  of  Il|  sets  so  that  for  any  z e "’co, 

A(z)  **  Tz  is  well-founded  , and 
B(z)  **  Uz  is  well-founded  . 

For  z e "’co,  consider  a game  Gz  played  with  integers: 

I : x(0)  x(2) 

II:  x(l)  x(3) 

I wins  exactly  when  there  is  an  i e co  such  that  xfi  e Uz  and  xn\i  £ 77,  i.e.  I 
manages  to  stay  on  the  tree  Uz  until  II  falls  off  of  77  and  confirms  membership 
for  z in  A. 

Set 

A*  = [z  e A | 77  has  no  winning  strategy  in  Gz)  , and 
B*  — [z  e B | 7 has  no  winning  strategy  in  G~]  . 

If  z e A — B,  then  z e A*:  Letting  y e [LL],  no  play  where  7’s  moves  are 
x(2 i)  = y(i ) for  i e co  can  be  a win  for  77.  Analogously,  if  z € B — A,  then 
z e B*.  Finally,  if  z e A n B,  then  z e A*  U B*  as  7 and  II  cannot  both  have 
winning  strategies  in  Gz.  This  case-by-case  analysis  shows  that  AUB  = A*  LIB*. 

Next,  since  in  each  Gz  any  win  for  7 is  secured  at  a finite  stage,  by  27.1 
(cf.  the  remarks  afterward)  Gz  is  determined.  Hence,  A*  n B*  = 0. 

Finally,  the  13.1  representation  implies  that  for  any  z e "’co, 

A*(z.)  **  A(z)  A Vr(r  is  a strategy  for  //—»■...) 
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where  ...  is  arithmetical,  so  that  A*  is  Tlj  (a).  B*  is  similarly  77}  (a),  and  so  the 
proof  is  complete.  H 

This  is  a singular  example  of  a new  proof  of  an  old  result  that  stimulates  a re- 
generation of  the  ambient  theory.  Addison  and  Martin  quickly  and  independently 
came  to  the  idea  of  assuming  Projective  Determinacy  (PD)  to  derive  structural 
conclusions.  With  E-}  having  the  reduction  property  also  a classical  result  (see  af- 
ter 29.9)  they  each  saw  how  to  generalize  Blackwell’s  proof  to  show  that  Det(A') 
implies  that  11}  has  the  reduction  property.  Then  Martin  and  Moschovakis  inde- 
pendently propagated  the  reduction  property  through  the  projective  classes  under 
PD.  With  ZFC  securing  the  reduction  property  only  for  IlJ  and  as  it  turns  out, 
and  delimiting  it  to  at  most  one  of  r or  —<r  (29.4),  PD  is  a regularity  hypothesis 
that  provided  the  deductive  power  to  establish  reduction  for  at  least  one  of  r or 
—<r  for  every  r by  a natural  inductive  process. 

The  Prewellordering  Property 

Although  the  Blackwell  proof  can  be  generalized  to  propagate  reduction  through 
the  projective  hierarchy  under  PD,  attention  soon  shifted  to  a stronger  property 
that  could  be  similarly  propagated.  That  property  had  already  been  abstracted 
by  Moschovakis  in  1964  from  the  classical  ordinal  analysis  of  11}  and  sets, 
and  being  intrinsic  to  sets  rather  than  relating  two  sets  it  soon  assumed  a greater 
prominence.  The  formulation  proceeds  through  several  definitions: 

A relation  < on  some  A is  a prewellordering  (of  A)  iff  < is  reflexive  on  A, 
transitive,  connected  on  A (i.e.  Vx,  y e A(x  < y v y < x )),  and  well-founded. 
Thus,  such  a < satisfies  all  the  conditions  for  a well-ordering  of  A except  possibly 
anti-symmetry:  there  may  be  distinct  x,  y e A such  that  x < y and  y < x.  With  a 
< contextually  clear,  <,  >,  and  >-  have  the  expected  derived  meanings.  A norm 
on  A is  a function  p:  A ->  On.  Such  a p induces  a prewellordering  of  A 
defined  by: 

X <p  y iff  p(x)  < p(v)  - 

Conversely,  for  any  prewellordering  ■<  of  A there  is  a norm  p : A — * On  such  that 
<p  — <.  A canonical  such  norm  is  the  rank  function  of  < (cf.  0.3);  it  is  the  one 
norm  whose  range  is  an  ordinal. 

To  illustrate,  let  U be  an  a>\  -complete  ultrafilter  over  a set  S,  and  for 
/,  g:  S ->  On  define 

f <ug  iff  H e S | /(»)  < g(/)}  e U . 

Assuming  DC,  the  lack  of  infinite  descending  chains  implies  well-foundedness, 
so  <u  is  a prewellordering.  The  corresponding  rank  function  is  just  that  function 
taking  / to  its  equivalence  class  \ f]u  in  the  (transitive  collapse  of  the)  ultrapower. 

Definability  is  incorporated  next.  For  A c k(wto),  p:  A — >•  On  is  a V -norm 
iff  there  are  R+  in  r and  R in  —>r  with  R+,  R c 2/:(m co ) such  that  for  any 
ye  A, 
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xeA  A p(x)  < p(y)  R+(x,y)  R (x,y), 

i.e.  {x  e A | p(x)  < p(y)}  is  in  Ar  “uniformly  in  y”.  For  A in  r,  this  is 
stronger  than  requiring  that  the  induced  prewellordering  <p  be  in  r,  but  weaker 
than  requiring  that  <p  be  in  Ar.  Finally, 

r has  the  prewellordering  property  iff  every  A in  r has  a /’-norm  . 

This  definition  serves  for  classes  more  general  than  the  projective  classes.  For 
the  latter  only  a representative  set  need  be  considered:  r has  the  prewellordering 
property  iff  a set  A c 2(coa>)  universal  for  r has  a r -norm.  (For  1 < k e a>  any 
subset  of k (“«)  can  be  homeomorphically  identified  with  a subset  of  “'ey,  which 
can  then  be  endowed  with  a /’-norm  derived  from  one  for  the  universal  set.) 

A useful,  if  less  motivated,  characterization  of  /’-norm  casts  a different  light 
on  Moschovakis’s  property: 

29.6  Exercise.  Suppose  that  A is  in  r and  p:  A ->  On.  Then  p is  a T -norm  iff 
the  following  relations  are  in  /’: 

x <*  y xeA  A (y  e A ->  p(x)  < p( y))  . 

x<*y  xeA  A (y  e A ->  p(x)  < p( y»  . 


Hint.  Note  first  that  if  R+ 

and  R are  as  in  the  definition  of  /’-norm,  then 

♦ 

x<Py  ^ 

xeA  A (R+(x,  y)  v -^“(y,  x))  , and 

x <*  y 

x e A A -iR-(y,  x)  . 

For  the  converse,  set 

R+  (x,  y)  x <*  y , and 

R-(x,y)  -'(y  <*  x)  . 

H 

For  /’-norms  P>  <*P  and  <*  are  taken  as  basic.  They  are  just  what  is  needed 
for  a simple  reduction  process: 

29.7  Proposition  (Kuratowski  [36],  Addison  [58]).  If  T has  the  prewellordering 
property,  then  T has  the  reduction  property. 

Proof.  Suppose  that  A,  B C k(‘°a> ) are  both  in  T.  With  the  constant  functions 
6:  <x>  —>  [0]  and  i:  to-*  [1],  set 

R — (A  x [0])U(B  x [1])  . 

R is  in  r,  so  let  p:  R — > On  be  a /’-norm.  The  idea  is  to  see  to  which  set  A or 
B membership  is  first  secured  according  to  the  norm.  Define  A* , B*  C k("Ja>)  by 
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x e A*  (x,0>  <*  (x,  1)  , and 

x e B*  (x,  1)  <*  (x,  0>  . 

By  the  exercise  (A*,  B *)  reduces  (A,  B)  and  A*  and  B*  are  in  F . H 

The  following  is  a simple  consequence  of  the  basic  13.1  representation  of 
nj  sets,  and  indeed  harkens  back  to  the  original  ordinal  analysis  of  IlJ  sets  in 
Luzin- Sierpinski  [18]. 

29.8  Exercise.  For  any  a e wa>,  n\(a)  has  the  prewellordering  property. 

Flint.  Suppose  that  A C kf°w)  is  Tl\(a),  with  T a tree  on  kco  x co  such  that 
A(w)  **  7„  is  well-founded  . 


Define  p:  A ->  On  by 

P(w)  = ||7W||  , 

the  height  of  the  well-founded  tree  T„.  That  this  is  a Fl\  (fl)-norm  follows  from 
the  arguments  for  the  relations  R-  and  R in  the  proof  of  13.17:  Fixing  the 
parameter  x there,  take  y <*  z to  correspond  to  R-(0,  x,  y,  z)  and  y <*  z to 
~‘R<(0,  x,  z,  y).  H 

29.7  now  implies  29.5,  and  this  is  essentially  the  classical  proof  of  that  result. 
The  early  descriptive  set  theorists  did  not  get  to  a useful  representation  of  E, 
sets  (cf.  13.14),  but  Novikov  [35]  did  get  results  about  them  using  an  ordinal  anal- 
ysis lifted  from  the  IT}  sets.  Moschovakis  formulated  that  lifting  schematically: 

29.9  Proposition  (Novikov  [35],  Moschovakis).  Suppose  that  C r and 
r has  the  prewellordering  property.  Then  so  does  3lT.  Consequently,  for  any 
a e “a),  S)  (a)  has  the  prewellordering  property. 

Proof.  Suppose  that 

A(x)  **  3 lwB(x,  w) 

where  B is  in  F . Let  v be  a /’-norm  on  B,  and  define  p:  A — > On  by: 

p(x)  = min({v(x,  w)  \ B(x,  w )})  . 


Then  in  terms  of  29.6, 

x<*y  31wV1j((x,  w)  <*  (y,  z))  , and 
x <*  y 31wV1z((x,w)<*(y,z)), 

and  since  V'T  C T,  these  relations  are  in  31/’.  H 

This  implies  the  result  of  Kuratowski  [36]  that  has  the  reduction  property, 
and  so  the  results  of  Novikov  [35]  that  IT}  has  the  separation  property  but  H} 
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does  not.  Note  the  importance  of  the  well-foundedness  of  the  prewellordering  in 
the  proof;  well-foundedness  was  not  needed  in  29.7,  but  is  a crucial  feature  in  the 
inductive  propagation  of  the  prewellordering  property. 

What  happens  at  the  higher  levels?  L not  only  delimited  the  classical  efforts 
but  also  provided  the  first  analysis  consistent  with  ZFC.  We  again  start  with  an 
abstract  definition  for  a general  statement: 

Assuming  that  -<  is  a (strict)  well-ordering  of  in  ordertype  w\ , let  IS^  c 
2(u’co)  be  given  by 

IS^(x,  y ) {(x);  | i e co}  — [z  e \ z < y]  ■ 

Note  that  if  -<  is  in  r,  then  it  is  also  in  —>r  as  x < y **  x f y A —>(y  -<  x), 
and  so 


IS^(x,  y)  V°!'((x)i  -<  y)  A V*z(z  -<  y -a  3 °i(z  = (*);)) 

shows  that  IS^  is  in  V1/"  flV1-'/".  The  following  imposes  a stronger  restriction 
on  IS^: 


-<  is  r -good  iff  1S^  is  in  Ar  . 

It  is  simple  to  see  that  if  -<  is  r - good,  then  it  is  in  F,  and  that  -<  is  r- good  iff 
-<  is  -'/"-good. 

29.10  Proposition.  Suppose  that  —>r  C 3 1 F and  there  is  a 3 1 F-good  well- 
ordering of  01  co.  Then  3 1 F has  the  prewellordering  property. 

Proof.  Suppose  that  A is  31/",  where  for  simplicity  A c wco,  say 

A(x ) **  3 lwB{x,w) 

where  B is  in  F.  Let  -<  be  a F- good  well-ordering  of  01  co,  -<2  the  induced 
lexicographic  well-ordering  of  2(“tt>),  and  v:  2(wco)  -»  On  the  corresponding  rank 
function.  For  each  x e A,  let  wx  by  the  -<-least  w so  that  B(x,  w),  and  define 
p:  A — >•  On  by 

p(x)  = v(wx,x)  . 

(p(x)  could  have  been  taken  to  be  just  the  ^-rank  of  wx,  but  for  a later  purpose 
an  injective  norm  is  produced.)  Then 

x <*  y **  3 1w(B(x,w)  A 31m(IS^(m,  w)  a V°i-<B(y,  («),•)))  V 
^lw[B{x,w)  A B(y,w)  A 

3'm(1S^(m,  w)  A V°/(-'B(x,  (u)j)  A —•B(y,  (w), ))  A x -<  y]  , 


and  similarly  for  <*.  Since  —>r  c 31/",  these  relations  are  thus  seen  to  be  in 
31  /’,  and  so  the  result  follows.  H 
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29.11  Corollary.  If  01  oj  C L,  then  for  any  a e 10  co  and  2 < n e co,  Ejfa)  has  the 
prewellordering  property. 

Proof.  By  13.11,  2(“&>)  fl  <L  is  a IT,1 -good  well-ordering  of  “w.  H 

It  was  to  become  clear  from  Silver  [71b:  440]  that  the  same  conclusion  can 
be  drawn  if  the  reals  are  included  in  an  inner  model  L[U]  of  measurability;  by 
20.17  the  reals  in  these  models  coincide. 

29.12  Corollary.  If  co  C L[U\,  then  for  any  a e “ co  and  2 < n e co,  E\  (a)  has 
the  prewellordering  property. 

Proof.  The  result  holds  in  any  case  for  n — 2 by  29.9,  and  for  n > 2 there  is  by 
20.20  a E\  -good  well-ordering  of  "&>.  H 

Thus,  if  V — L or  even  V — L[U],  then  the  reduction  property  holds  for 
II]  and  then  for  X1  for  every  n > 2.  This  was  first  seen  for  L by  Addison  [59]; 
Novikov  [51]  had  announced  the  first  result  of  this  kind,  that  if  V — L,  then  for 
sufficiently  large  n the  separation  property  holds  for  II,' . 

This  persistence  of  reduction  on  the  X side  depended  on  the  external  circum- 
stance of  having  a well-ordering  of  01  co  with  nice  definability  properties.  Addi- 
son [74:9-10]  speculated  in  1967  on  other  possibilities,  and  his  remarks  “turned 
out  to  be  prophetic”,  for  soon  afterwards  a more  natural  pattern  for  reduction 
through  the  projective  hierarchy  was  established  under  PD,  and  even  more  to  the 
point,  the  argument  featured  a natural  inductive  propagation.  It  was  for  this  pur- 
pose that  Martin  introduced  his  filter  over  Turing  degrees.  In  another  approach 
Moschovakis  explicitly  propagated  the  prewellordering  property  using  strategies 
for  specific  games,  establishing  what  he  later  called  the  First  Periodicity  Theorem: 

29.13  Theorem  (DC)(Martin  [68];  Moschovakis  - Addison-Moschovakis  [68]). 
Assume  Det{  A p).  Suppose  that  31  T c r and  r has  the  prewellordering  property. 
Then  so  does  V1  T. 

Both  the  Martin  and  Moschovakis  arguments  will  be  given,  but  first  the  con- 
sequence of  this  result  and  29.9  that  PD  inductively  propagates  the  prewellordering 
property  in  a “zig-zag”  pattern: 

29.14  Corollary  (DC).  Assume  PD.  Then  for  any  a e c°co,  the  following  classes 
have  the  prewellordering  (and  hence  the  reduction)  property: 

Z\(a)  El(a) 


TI{  (a)  17 j (a) 

A 


In  contrast  to  the  stark  situation  in  L and  in  L[U\,  this  perpetuation  of  the 
alternation  beyond  the  classical  “zig”  from  IT}  to  Xi  of  the  reduction  property 
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suggested  that  PD  may  be  a hypothesis  with  the  deductive  power  to  settle  major 
questions  about  the  projective  sets  in  ways  that  naturally  extend  the  classical 
theory. 

The  following  is  Martin’s  argument  with  Turing  degrees,  cast  in  terms  of 
prewellorderings.  The  idea  of  ranking  functions  via  ultrapowers,  so  crucial  in 
later  arguments  in  descriptive  set  theory,  first  appeared  here  (cf.  after  15.4). 

First  Proof  of  29.13.  Suppose  for  simplicity  that  Ac®®  and 

A(x)  **  VlwB(x,w) 

where  B is  in  r.  Let  v be  a /’-norm  on  B.  Recalling  the  terminology  toward 
28.4,  for  x e A define  V j -»  On  by: 

sx(d)  — sup({v(x,  w)  + 1 | [w]T  < d})  , 

and  for  y e A also,  set 


Zxy  = {d  eV x | sx(d)  < sy(d)}  . 

Note  first  that  for  a , x,  y G 

s*([>]t)  < ^(Mt)  Vw  <T  a3z  <t  a(v(x,  w)  < v(y,  z))  ■ 

It  is  well-known  that  the  displayed  quantifiers  are  arithmetical,  about  number  codes 
for  “reduction  procedures”  (with  the  codes  being  for  formulas  and  finite  sequences 
if  we  proceed  in  terms  of  12.4).  Consequently,  it  follows  from  v being  a /’-norm 
and  the  definitions  that 

{r7  G w (d  | [u]']'  G Zxvj 

is  in  Ap-  Hence,  with  Det(zir)  Martin’s  argument  for  28.4  implies  that  for  any 
x,  y e A, 

either  Zxy  g Mj  or  else  Vj  — Zxy  e MT  . 

Define  an  ordering  < of  A by: 

x < y iff  Zxy  g Mt  . 

< is  a prewellordering,  DC  and  the  o>\ -completeness  of  MT  ensuring  well- 
foundedness,  so  let  p:  A ->  On  be  the  corresponding  rank  function,  (p  ranks 
the  members  x of  A as  per  the  corresponding  sv’s  in  an  ultrapower  of  On  by  Mj.) 
To  complete  the  proof,  we  show  that  p is  a V1  /’-norm: 

First,  for  y G A, 

x G A A p(x)  < p(y)  **  B1aV1b(a  <j  b — > 

Vw  <T  b3z  <t  b[B(x,  w ) A v(x,  w)  < v(y,  z)])  . 

(Note  that  for  any  w,  a e mco  there  is  always  a b e wco  such  that  w <t  b and 
a <t  b,  and  so  x G A is  indeed  a consequence  of  the  right  side.)  The  quantifiers 


412 


Chapter  6.  Determinacy 


involving  <T  are  arithmetical  as  observed  before,  and  the  relation  [. . .]  is  in  ->F 
as  v is  a G-norm.  Consequently,  the  overall  relation  is  in  31V1-'.r,  or  equivalently 
in  -•V131r,  and  hence  in  -A/1/"  as  31  r C r . 

We  also  have  for  y e A 

x e A A p(x)  < p(y)  **  x e A A [d  eV T | sy(d)  < sx(d)}  £ Mj 
**  x e A A 'ixa3[b(a  <j  b A 

3u;  <T  b'iz  <t  b[v(y,  z)  < v(x,  w)])  . 

Since  for  y e A the  relation  [. . .]  is  in  F as  v is  a /’-norm,  the  overall  relation 
is  in  and  hence  in  V1/"  as  31  F C F . The  definition  of  V^-norm  thus 

satisfied,  the  proof  is  complete.  H 

Just  as  the  classical  argument  for  29.9  lifts  the  prewellordering  property  from 
Il‘  to  H'!+1  by  the  simple  expedient  of  taking  minima  to  define  a new  norm, 
Martin  had  found  a way  to  “integrate”  over  suprema  to  lift  the  prewellordering 
property  from  L*  to  TT('I+ , . Moschovakis  was  able  to  take  suprema  more  concretely 
with  specific  strategies,  with  an  argument  that  was  to  have  an  important  extension 
(30.8): 

Second  Proof  of  29.13.  Suppose  again  that  A c wa>  and 

A(x)  **  V'wBlx,  w) 


where  B is  in  r . 

For  x,  y e consider  the  game  G(x,  y)  played  with  integers: 

/ : r(0)  t( 2) 

II:  t{  1)  t(  3)  ... 

II  wins  if  either:  (a)  -■ B(y,tn ),  or  else  (b)  B(x,ti ) and  v{x,ti)  < v(y,tu). 
This  game  is  determined:  If  y £ A,  then  II  can  always  win  by  ensuring  (a).  If 
y e A,  then  (a)  never  occurs,  and  the  definition  of  /"-norm  ensures  that  the  payoff 
set  according  to  (b)  is  in  Ar,  and  Det(Z\r)  is  being  assumed. 

Define  ■<  on  A by: 

x < y II  has  a winning  strategy  in  G(x,  y ) . 

Thus,  x < y exactly  when  sup({v(x,  w)  \ w e wa>})  < sup({u(y,z)  | z e u’co}) 
uniformly,  in  the  sense  that  there  is  a winning  strategy  r for  II  so  that  for  any 
w e “&>,  v(x,  w)  < v(y,  (w  * r)//). 

That  a is  a prewellordering  of  A is  checked  clause  by  clause: 

< is  reflexive:  For  x e A,  1 1 has  a simple  winning  strategy  in  G(x,  x);  just 
copy  /’ s moves. 

< is  transitive:  Suppose  that  xj,ze  A,  x < y,  and  y < z..  Letting  n be  a 
winning  strategy  for  II  in  G(x,  y)  and  t2  winning  for  II  in  G(y,  z),  if  a strategy 
r for  II  could  be  devised  so  that  for  any  w e “co. 
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(w  * r )//  = ((w  * Ti )//  * r2)// 


with  the  consequent 

v(x,  w)  < v{y , (u>  * ti)//)  < u(z,  (w  * r)//)  , 

then  r would  be  winning  for  II  in  G(x,  z)  and  x < z . But  this  is  simple  to  do: 

For  the  initial  response  set  r ((/))  = r2  (( Ti  (<?)))) . Generally,  II  keeps  partial 
plays  of  G(x,  y ) and  G(y,z ) going,  with  each  of  /’s  moves  in  G(x,  z)  viewed  as 
a move  in  G(x , y),  considering  the  response  according  to  x\  as  /’  s next  move  in 
G(y,  z),  and  then  responding  to  it  according  to  r2. 

< is  connected'.  Suppose  that  x,  y e Z and  —>(x  <y).  By  determinacy,  I then 
has  a winning  strategy  a in  G(x,  y),  and  since  x,  y e A,  v(x,  ( o*w)i ) > v(y,  w ) 
for  any  w e <°co.  If  a strategy  r for  II  could  be  devised  in  G(y,x)  so  that  for 
any  w e 01  co, 

(er  * w)i  — ( w * r )// 

with  the  consequent 

v(y,  w ) < v(x,  (ct  * w)i ) = v(x,  (w  * r )// ) , 

then  r would  be  winning  for  II  in  G(y,  x)  and  y < x.  But  this  is  simple  to  do: 
Set  r ((/'})  = er(0),  and  generally 

r((s(0),  . . . , s(2n  + 2)»  = <r((s(l),  5(0),  5(3),  5(2), . . . , 5(2n  + 1),  5(2b)»  . 

At  each  turn  II  temporarily  ignores  /’  s previous  move  s(2n  + 2),  transposing 
the  consecutive  odd  and  even  values  of  the  partial  play  until  then,  and  responds 
according  to  a. 

< is  well-founded:  With  DC  in  hand,  assume  to  the  contrary  that  xo  >-  x\  >- 
x2  ...  are  all  in  A,  where  with  connectedness  x >-  y iff  ->(x  < y).  For  each 
i e co,  let  <t,-  be  a winning  strategy  for  I in  G(x,,  xI+i),  so  that  for  any  w e coa>, 
v(x,-,  ((Ti  * w)i)  > v(Xj+i,  w).  Simultaneously  produce  plays  tj  for  G(x,-,  x(+i)  for 
every  i e co, where  I follows  er,-  in  G(x;,X;+i),  and  7/’s  nth  move  in  G(x,-,x,+i) 
(the  (2 n + l)st  move  of  the  game)  is  /’ s nth  move  in  G(xi+i , x,+2)  (the  2nth 
move  of  that  game).  We  then  have  (G ) //  = (b+i)/  for  every  i e co,  arriving  at 
the  contradiction 

v(x0,  (to)/)  > v(xi,  (t0)//)  = v(xi,  (fl)/)  > v(x2,  (fl)//)  = v(x2,  (t2)/)  > . . . . 

It  remains  to  verify  that  the  norm  p corresponding  to  < is  a V'C-norm. 
Suppose  that  y e A.  Then  if  II  has  a winning  strategy  in  G(x,  y),  it  follows  that 
x e A.  (Otherwise,  for  any  w such  that  ~^B(x,  w),  w * x would  be  a play  that 
loses  for  II  according  to  the  rules.)  Hence, 

x e A A p(x)  < p(y)  1 1 has  a winning  strategy  in  G(x,  y) 

**  ErV'wIr  is  a strategy  for  II  A 

[B(x,  w)  A v(x,  w)  < v(y,  (w  * r )//)])  . 
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Strategies  are  coded  by  reals  and  the  relation  [. . . ] is  in  ->r  as  v is  a /"-norm. 
Consequently,  the  overall  relation  is  in  i'V1-'/",  or  equivalently  in  -A/'i1/",  and 
hence  in  -A/1/"  as  31/"  c r. 

Also, 

x e A A p(x)  < p(y)  I has  no  winning  strategy  in  G(x,  y ) 

**  Va3lz(cr  is  a strategy  for  I — >■ 

[B(x,  (a  * z)i)  A v(x,  (a  * z)i)  < v{y,  z)])  . 

Since  the  relation  [. . . ] is  in  r as  v is  a C-norm,  the  relation  is  in  V'31/"  and 
hence  in  V1/"  as  3 1 F C r . The  definition  of  V1  /’-norm  thus  satisfied,  the  proof 
is  complete.  H 

This  section  is  brought  to  a close  with  a discussion  of  the  structural  properties 
thus  far  considered  for  more  general  classes  under  full  AD.  These  results  are  part 
of  a general  theory,  one  underpinned  by  a basic  lemma  established  by  William 
Wadge  in  1968  toward  his  Berkeley  dissertation  [83].  For  A,  B c®®  define 

A <w  B iff  A — f~1(B)  for  some  continuous  /: 

i.e.  A is  a continuous  pre-image  of  B.  The  following  is  Wadge ’s  Lemma: 

29.15  Lemma  (Wadge  [72]).  Assume  AD.  Suppose  that  A,  B c '"o>.  Then  either 
A <w  B or  else  B <w  “a»  — A. 

Proof.  Consider  the  Wadge  game  WG(A,  B ) played  with  integer  moves: 

I : x(0)  x(2) 

II  : x{\)  x(3) 

II  wins  exactly  when  X/  € A iff  Xu  € B . If  r is  a winning  strategy  for  II,  then 
z e A iff  (z*  r)n  e B for  any  z e coa>,  and  as  the  map  sending  z to  (z  * r )//  is 
clearly  continuous,  A <w  B.  On  the  other  hand,  if  a is  a winning  strategy  for  I, 
then  (o  * y)i  ^ A iff  y e B for  any  y e 01  oj,  and  so  similarly  B <w  mco  — A.  H 

This  simple  but  revelatory  lemma  established  a basic  connectedness  under 
AD  between  arbitrary  subsets  of  “®.  For  boldface  r and  just  with  a local  r 
version  of  the  lemma,  note  that  any  B C 01  co  in  r — —•T  is  complete  in  the  sense 
that  for  any  A c wco  in  T,  A <w  B.  (The  alternative  B <w  10 co  — A would  imply 
by  the  simple  definability  of  continuous  functions  that  B is  in  ->/".) 

For  any  A c U 

A is  continuously  closed  iff  A is  closed  under  continuous  pre-images 

i.e.  if  B e A and  A <w  B,  then  A e A.  Extending  the  terminology  for  projective 
classes  to  arbitrary  A c {_]k€(0V{k{u>co))  to  get  e.g.  ->A  with  the  expected  meaning, 


A is  nonselfdual  iff  —~A  f A . 
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Next,  for  A,  B C c°co  set 

A =w  B iff  A <w  B A B <w  A , and 
[A] w = {C  c “a,  | C =w  A}  . 

The  equivalence  classes  [A]w  are  known  as  the  Wadge  degrees.  Wadge  in  his 
dissertation  [83]  latterly  appearing  carried  out  a detailed  study  of  Wadge  degrees  of 
Borel  sets  assuming  the  corresponding  determinacy,  one  that  revealed  in  particular 
that  <w  on  these  sets  is  well-founded.  Leonard  Monk  found  a short  argument 
directly  relating  winning  strategies,  and  applying  this  with  AD  Martin  established 
a basic  structure  result  in  1973: 

29.16  Theorem  (DC)(Martin  and  Monk).  Assume  AD.  Then  <w  is  a well-founded 
relation. 

Van  Wesep  [78a]  exposed  the  basic  role  of  Wadge  degrees  and  provided  a 
proof  of  this  result.  It  follows  from  the  hypotheses  and  Wadge’s  Lemma  that 
if  each  Wadge  degree  is  coalesced  with  its  dual  class,  then  <w  well-orders  the 
resulting  classes:  Each  rank  of  <w  is  exactly  one  Wadge  degree,  or  else  the  union 
of  a Wadge  degree  and  its  dual  class  when  they  are  nonselfdual.  Since  each  of  the 
classes  2°,  11°,  L*,  and  £*  is  readily  seen  by  induction  to  be  continuously  closed, 
their  restrictions  to  Vlfco)  are  initial  segments  of  <w-  Determinacy  provides  an 
ultimately  fine  stratification  of  VC°co)  via  continuous  reducibility. 

Against  this  backdrop  the  following  results  established  under  AD  articulate 
the  pervasiveness  of  the  separation  and  reduction  properties  under  that  hypothesis: 
Van  Wesep  [78]  initially  observed  that  if  A is  continuously  closed  and  nonselfdual, 
then  at  most  one  of  A and  -<A  can  have  the  separation  property.  Steel  [81]  then 
showed  that  for  such  A at  least  one  of  A and  -•A  does  have  the  separation  property. 
The  arguments  for  29.2  and  29.3  relate  the  reduction  and  separation  properties,  so 
that  if  there  is  a set  universal  for  A,  then  at  most  one  of  A and  ->A  can  have  the 
reduction  property.  If  A is  continuously  closed  and  nonselfdual,  then  it  turns  out 
that  there  is  a set  universal  for  A;  however,  Van  Wesep  [78a:  163]  observed  that 
there  are  such  A so  that  neither  A nor  —•A  has  the  reduction  property.  Improving  a 
previous  result  Steel  [81]  then  showed  that  if  A is  continuously  closed,  nonselfdual 
and  closed  under  the  taking  of  finite  unions,  then  at  least  one  of  A and  —>A  does 
have  the  reduction  property. 

Further  results  about  general  classes  were  soon  established  in  ZF  + DC  + AD: 
Kechris-Solovay-Steel  [81]  explored  the  extent  of  the  prewellordering  property 
and  showed  for  example  that  if  A C L(R)  is  nonselfdual  and  closed  under  real 
quantification,  then  either  A or  -'A  has  the  prewellordering  property.  Steel  [81a] 
established  closure  properties  of  nonselfdual  A based  on  those  for  Aa.  Becker  [88] 
established  closure  properties  of  various  A under  category  and  measure  quantifiers. 

As  for  finer  work  in  the  ZF  + DC  context,  Borel  Wadge  Determinacy  is 
the  proposition  that  for  every  pair  of  Borel  sets  A,  B C "’a>  the  Wadge  game 
WG(  A,  B)  (as  in  the  proof  of  Wadge’s  Lemma)  is  determined.  Assuming  only 
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this  Alain  Louveau  [83]  described  the  Wadge  ordering  of  Borel  sets,  recasting 
work  from  Wadge’s  dissertation  [83].  Then  Louveau  and  Jean  Saint-Raymond 
(see  their  [87,88])  showed  rather  unexpectedly  that  Borel  Wadge  Determinacy  is 
already  provable  in  second-order  arithmetic. 
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30.  Scales,  Projective  Ordinals,  and  L(R) 

This  section  continues  the  exploration  began  in  §29  of  the  structural  consequences 
of  determinacy  hypotheses  in  descriptive  set  theory.  The  references  and  conven- 
tions are  as  for  §29,  and  as  there, 

ZF  + AC w (“co)  serves  as  the  ambient  theory  for  this  section  . 

But  what  serves  to  distinguish  is  the  increased  complexity,  and  the  dispensing  of 
several  proofs  in  favor  of  a brisk  account  of  the  main  developments.  At  the  end 
is  a survey  of  recent  results,  notably  those  about  the  structure  of  L(R). 

A question  left  unanswered  by  the  prewellordering  theory  of  the  previous 
section  is:  What  about  uniformization?  Continuing  in  our  general  setting, 

r has  the  uniformization  property  iff 

every  subset  of  2(“o>)  in  r can  be  uniformized  by  a set  in  7\ 

Thus,  II]  has  the  uniformization  property.  Using  a coded  union  as  in  the  argument 
for  29.7,  it  is  simple  to  see  that  if  r has  the  uniformization  property,  then  r has 
the  reduction  property. 

Martin  [68:  689]  conjectured  that  under  PD  the  uniformization  property  would 
hold  with  the  same  zig-zag  pattern  as  for  the  reduction  property.  Analyzing 
the  uniformization  arguments  of  Kondo  (13.17)  and  Martin- Solovay  (15.14), 
Moschovakis  in  1971  isolated  a strengthening  of  the  prewellordering  property 
appropriate  for  propagating  uniformization.  Although  somewhat  technical,  this 
new  property  soon  assumed  a central  role,  for  it  abstracted  a substantial  aspect  of 
determinacy  for  various  classes  of  sets  of  reals.  The  formulation  proceeds  through 
several  definitions: 

For  A c k(0>co)  and  ordinal  y,  a scale  is  a sequence  (pn  \ new)  of  norms: 
A -*  y satisfying  the  following  property:  Suppose  that 

[x,-  | i e w}  c A with  lim^^x,  = x , 

and  there  is  a g:  w ->  y such  that 

for  every  new  and  sufficiently  large  i e w,  p„(x; ) = g{ri)  . 


Then 

x e A,  and  for  every  n e w,  p„(x)  < g(n)  . 

Here,  lim,^w  is  in  the  usual  sense  of  convergence  with  respect  to  the  topology  of 
* (“'&>),  so  that  topological  convergence  through  A together  with  a norm  conver- 
gence is  to  imply  membership  in  A as  well  as  a lower  semicontinuity  with  respect 
to  the  norms.  Phrases  like  scale  on  A into  y will  make  explicit  the  parameters 
involved. 

Definability  is  incorporated  as  for  the  prewellordering  property:  First,  for 
new  let  h be  the  constant  function:  w — > {«}.  Then  a scale  (p„  | n e w)  on  A 
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is  a r -scale  iff  there  are  R+  in  r and  R in  —>r  with  R+,  R C 2k+l(ww)  so 
that  for  any  y e A and  n e &>, 

xeA  A p„(x)  < p„(y)  R+(h,x,  y)  **  R~(h,x,  y)  . 

It  is  simple  to  check  that  the  analogue  of  29.6  holds: 

30.1  Exercise.  Suppose  that  A is  in  r and  (p„  \ n e co)  is  a scale  on  A.  Then  it 
is  a r -scale  iff  the  following  relations  are  in  F : 

S<(n,x,  y)  x <*pn  y 

S<  (n,  x,  y)  x <*__  y H 


Finally, 

r has  the  scale  property  iff  every  set  in  r has  a r- scale  . 

The  following  useful  characterization  serves  to  motivate  the  concept  of  scale. 
In  the  terminology  of  § 13,  for  a tree  T on  y an  honest  leftmost  branch  is  an  / e [7] 
such  that  for  any  g e [T],  f(i)  < g(i)  for  eveiy  i e co.  Such  branches  occurred  in 
the  proof  of  II j Uniformization  13.17  and  were  emphasized  in  Kechris  [81];  trees 
T with  [T]  / 0 always  have  lexicographically  least,  or  leftmost,  branches,  but 
not  necessarily  honest  leftmost  branches.  For  a tree  T on  kco  x y,  T has  honest 
leftmost  branches  iff  for  any  x e p[T],  Tx  has  an  honest  leftmost  branch. 

30.2  Proposition.  For  any  A C k(°>a>),  there  is  a scale  on  A into  y iff  there  is  a 
tree  T on  kco  x y having  honest  leftmost  branches  such  that  A — p\T\ 

Proof.  Taking  k = 1 for  simplicity,  suppose  first  that  (pn  \ n e co)  is  a scale  on  A 
into  y.  Set 

T = {(x\i,  (p0(x),  • • • , Pi- iW))  | x e A}  . 

Then  T is  a tree  on  kco  x y,  and  A c p[T],  In  fact,  A = p[T]:  Sup- 
pose that  for  some  x e wco,  g e [7]v],  so  that  for  some  {x,-  | i e co}  c A, 

{Xi\i,  (po(x;), Pi— i(x,)»  = (x|t,  g\i)  for  every  i e co.  Since  liny^x,  = x 

and  pn(xj)  = g(n)  for  n < i e co,  x e A by  the  definition  of  scale.  It  also  follows 
that  p„(x)  < g(n)  for  every  n e co,  and  hence  (p„(x)  | n e co)  is  an  honest 
leftmost  branch  of  [ Tx  ] . 

For  the  converse,  suppose  that  A = p[T]  for  a tree  T on  cox  y such  that  for 
each  x e A,  fx  is  an  honest  leftmost  branch  of  Tx.  For  each  new  define  a norm 
Pn-  A^  y by: 

Ph(x)  = fx(n)  . 

Suppose  now  that  {x,-  | i e w}  c A with  lim,^ffl  x,  = x and  g:  w — >■  y is  such 
that  for  every  n and  sufficiently  large  i,  p„(x/)  = g(n).  Then  it  is  simple  to  see 
that  (x,  g)  e [T].  Hence,  x e A and  for  every  n,  p„(x)  = fx(n)  < g(n).  H 
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Thus,  a /"-scale  on  A provides  a representation  of  A as  p[T ] for  some  T 
having  honest  leftmost  branches  and  satisfying  a /"  definability  condition.  How 
that  condition  is  related  to  uniformization  is  clarified  by  the  next  two  results, 
implicit  in  the  proof  of  n[  Uniformization. 

30.3  Exercise.  For  any  a e ®<w,  n\  {a)  has  the  scale  property. 

Proof.  Suppose  that  A c k(0>co)  is  IJ}(a),  with  T a tree  on  ka>  x co  such  that 
A( w)  rw  is  well-founded  . 

The  argument  for  13.14  provides  a corresponding  tree  T on  kco  x a>\  such  that 
A(w)  e “co,  (w,  g)  e [f]) 

where  g provides  an  order-preserving  map:  Tw  ->  a>,  verifying  the  well- 
foundedness  of  Tw.  As  described  in  the  proof  of  13.17,  T has  honest  leftmost 
branches  defined  when  A(w)  given  by: 

[ Prw( s„)  if  s„  e T„  , and 
gwW  = { 

l 0 otherwise  . 

It  follows  from  the  proof  of  30.2  that  (p„  \ n e co)  defined  by  p„(w)  = gw(n)  is 
a scale  on  A.  Finally,  that  it  is  a /711(a)-scale  follows  from  the  arguments  for  the 
relations  R-  and  R in  the  proof  of  13.17:  In  terms  of  30.1,  take  5<(i,  y,  z)  to 
correspond  to  R~(i,  x,  y,  z ) and  5<(i,  y,  z)  to  R<(i , x,  y,  z).  H 

30.4  Proposition  (Moschovakis  [71]).  //  V1  /"  C F and  F has  the  scale  property , 
then  r has  the  uniformization  property. 

Proof.  Suppose  that  A C 2(fa>)  is  in  r,  with  (pn  \ n e a>)  a /"-scale  on  A into 
y say,  and  corresponding  relations  R 1 e F and  R e —>r.  The  argument  for 
30.2  shows  how  the  scale  leads  directly  to  a tree  T on  2a>  x y having  honest 
leftmost  branches  such  that  A = p\T],  Imposing  the  definability  condition  the 
uniformization  can  be  carried  out  as  in  the  proof  of  13.17: 

Define  Ao  c 2^°a>)  by: 

Aq{x,  y)  A(x,y)  A V1zV°m[(y(m)  = z(m) 

A V°n  < m(R~(fi,  x,  y,  x,  z)  A R~(n,x,z,x,y )) 

—*■  ( y(m ) < z(m)  V (y(m)  = z(m)  A R+(m,  x,  y,  x,  z)))]  . 

This  is  in  /"  by  the  definability  of  R+  and  R . Moreover,  if  A(x,  y ) and  A(x,  z), 
then  R~(h,x,y,x,z ) A R~ (h,  x,  z,  x , y))  is  equivalent  to  pn(x,y)  = pn(x,z), 
and  R+(m,  x,  y,  x,  z)  to  pm(x,y ) < pm(x,z ).  By  how  the  tree  T was  defined, 
it  follows  that  for  any  x e 0Ja>,  through  honest  leftmost  branches  Ao(x,  y)  does 
indeed  pick  out  that  y of  the  leftmost  pair  (y,  g)  such  that  (x,  y,  g)  e T if  there 
is  one,  and  so  the  proof  is  complete.  H 
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Previous  arguments  lead  to  the  existence  of  scales  in  other  situations: 

30.5  Exercise.  Suppose  that  —>r  C and  there  is  a B1  T -good  well-ordering 
of  "ft).  Then  Bl  T has  the  scale  property.  In  particular,  if  ® a>  C L,  or  even 
wco  C L\U]for  an  inner  model  L\U ] of  measurability,  then  for  any  a e 10  co  and 
2 < n e co,  Si  (a)  has  the  scale  property. 

Hint.  Consider  the  proof  of  29.10.  The  norm  p provided  is  injective,  and  so  it  is 
simple  to  see  that  (p  \ n e a>)  is  a scale.  Furthermore,  it  is  a /"-scale  by  30.1. 
The  second  assertion  of  the  exercise  follows  as  for  29.11  and  29.12.  H 

Note  that  13.19  establishes  the  uniformization  properties  in  L more  directly 
than  through  30.4  and  30.5. 

For  the  next  result  our  definitions  are  temporarily  liberalized  to  allow  “Ajt  (a)- 
scales”  with  the  expected  meaning. 

30.6  Exercise  (Mansfield  [71],  Martin- Solovay  [69]).  Assume  that  Vo  e ":a>(an 
exists).  Then  for  any  b e 0)co,  every  Tl){b)  set  has  a A\{b)-scale  into  um. 

Hint.  Argue  as  for  30.3,  but  using  15.14,  particularly  the  parenthetical  comment 
at  the  end  of  its  proof.  H 

The  scale  property  propagates  in  a manner  similar  to  the  prewellordering 
property,  and  this  reinforced  the  zig-zag  pattern  under  determinacy.  The  following 
is  analogous  to  29.9: 

30.7  Exercise  (Moschovakis  [71]).  Suppose  that  V1/"  C r and  r has  the  scale 
property.  Then  so  does  3 1 r.  Hence,  for  any  a e wco,  S\  ( a ) has  the  scale  property. 

Hint.  Suppose  for  simplicity  that  A C “’co  and 

A(x)  **  BlwB{x,w) 

where  B is  in  r,  and  let  (vn  n e co)  be  a /"-scale  on  B.  Define  new  norms 
through  a minimization  process: 

For  convenience,  assume  by  30.4  that  whenever  A(x),  there  is  a unique  wx 
such  that  B(x,  vox).  For  each  n e co,  define  <n  on  mco  by 

x <„  y iff  (v0(x,  wx),  wx (0),  vfx,  wx),  wx(  1), . . . , v„(x,  wx),  wx(n )) 

<lex  (Vo(y,  Wy),  Wy( 0),  V\{y,  Wy ),  Wy  ( 1 ) , . . . , ^(j,  Wy ) , Wy  («  ) } 

where  <iex  is  the  lexicographic  ordering,  and  let  p„  be  the  corresponding  rank 
function.  Then  ( pn  \ n e co)  is  a scale  on  A: 

Suppose  that  [x,-  | i e co)  c A with  lim = x,  and  there  is  a g:  co  -> 
On  such  that  for  every  n and  sufficiently  large  i,  pn (x,- ) = g(n).  Then  it  is 
straightforward  to  see  that  lim,-^  wXj  = w for  some  w and  that  for  some  h:  co  —> 
On  read  off  from  g,  v„  (x; , uiXj ) = h(n)  for  every  n and  sufficiently  large  i. 
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Consequently,  B(x , w)  (and  so  w — wx),  and  v„(x , w)  < h(n ) for  every  n.  But 
then,  A(x)  and  for  every  n,  p„(x ) < g(n)  by  definition  of  the  ;<„’s. 

That  { pn  | n e co)  is  a El'C-scale  follows  from  30.1  and  the  argument  for 
30.4.  H 


Just  as  the  First  Periodicity  Theorem  29.13  had  complemented  the  process  of 
taking  minima  for  the  prewellordering  property,  Moschovakis  again  took  suprema 
to  establish  the  Second  Periodicity’  Theorem : 

30.8  Theorem  (DC)(Moschovakis  [71]).  Assume  Det(Ar).  Suppose  that  C 

r and  r has  the  scale  property.  Then  so  does  V1  T . H 

The  proof  proceeds  by  elaborating  Moschovakis’s  argument  for  29.13,  in- 
terlacing countably  many  of  the  games  G(x,  y)  used  there;  see  Moschovakis 
[80:  31  Iff]  for  details. 

30.9  Corollary  (DC).  Assume  PD.  Then  for  any  a e ® co , the  following  classes 
have  the  scale  (and  hence  the  uniformization)  property: 

£>)  £>) 


77/ (a)  Til  (a ) 

H 


Moschovakis  [73]  provided  a third  periodicity  theorem.  Proceeding  more 
generally,  Moschovakis  [80: 325ff]  gave  a new  explanation  for  the  periodicity  of 
the  prewellordering  property,  one  that  encompassed  both  29.9  and  First  Periodicity 
in  a single  norm  propagation  theorem:  For  a class  A satisfying  some  weak  closure 
conditions,  if  Det(A),  then  the  class  of  those  A C k(‘°co)  defined  by 

A(w)  **  {V%o30«iV0M230«3  . . .}B( w,  (no,  n\,  «2,  «3,  ■■■)), 

where  {V°»o30niV°n23«3 . . .}  is  the  game  quantifier  and  B C <:+1("a))  e A,  has 
the  prewellordering  property’.  The  game  quantifier  has  the  expected  interpretation, 
asserting  in  this  instance  that  [x  \ ~^B( w,  x)}  is  determined  with  a winning  strategy 
for  II. 

Moschovakis  [80:  335ff]  then  applied  the  ideas  to  scales  to  derive  in  ZF  + DC 
the  Third  Periodicity  Theorem,  which  showed  that  when  winning  strategies  exist, 
they  can  be  simply  defined.  Moschovakis  [73]  had  stated  the  main  consequence 
for  the  projective  classes:  If  n e co,  DefiE,,,).  A C 01  co  is  and  1 has  a 

winning  strategy’  in  Gm(A),  then  I has  a winning  strategy  which  (construed  as  a 
real)  is  A\n+l(a). 

From  these  periodicity  theorems  flowed  results  that  generalized  most  of  the 
structure  theory  for  II j and  Dj  t0  all  levels  of  the  projective  hierarchy,  often  with 
proofs  applicable  to  the  first  levels  and  arguably  simpler  than  the  original  ones 
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there.  The  elegant  Spector-Gandy  Theorem  was  so  generalized  by  Moschovakis 
[73]  (indeed,  Martin  had  conjectured  the  consequence  of  Third  Periodicity  stated 
above  for  this  purpose);  measure  and  category  extensively  analyzed  by  Kechris 
[73];  and  the  theory  of  largest  countable  sets  developed  by  Kechris-Moschovakis 
[72]  and  Kechris  [75], 

Expanding  on  this  last,  there  is  a TTj1  set  Cj  without  a perfect  subset  such 
that  any  77'  set  without  a perfect  subset  is  a subset  of  C\  (see  after  13.12),  and  a 
set  C2  with  analogous  properties  (see  after  14.9;  in  fact  C2  = C2  = mco  IT  L). 
Under  local  versions  of  27.9,  to  be  bereft  of  perfect  subsets  is  equivalent  to  being 
countable,  and  one  can  inquire  after  the  existence  of  a largest  countable  r set,  i.e. 
one  that  contains  every  other  countable  r set.  Results  in  the  cited  papers  show 
that  under  PD  those  r so  that  there  is  a largest  countable  /’  set  have  the  same 
zig-zag  pattern  as  for  the  prewellordering  and  scale  properties.  The  structure  of 
these  largest  countable  sets  is  analyzed  in  Kechris  [75],  Guaspari-Harrington  [76], 
and  Martin  [83], 

Moschovakis  and  Kechris  also  used  scales  in  the  early  1970’s  to  develop  a 
theory  of  “partially  playful  universes”.  Moschovakis  (see  Kechris-Moschovakis 
[78a:  §5])  showed  initially  that  for  2 < n e co,  Det(A|)  where  k is  the  greatest 
even  integer  less  than  n implies  in  ZF  + DC  that  there  is  a smallest  inner  model 
M"  of  ZFC  absolute  for  Xj)  relations  (as  described  before  13.15).  Fie  verified 
that  Mn  satisfies  GCFI,  has  a X'(I'+1-good  well-ordering  of  its  reals,  and  because  of 
the  absoluteness,  satisfies  Det(A,'_1)  if  V does.  This  in  particular  established  the 
relative  consistency  of  having  finitely  many  zig-zag’s  for  the  scale  property  and 
then  the  property  settling  on  the  £ side  thereafter.  Moschovakis  and  Kechris  then 
considered  for  1 < n e co  the  models  f.[C2„]  where  Ci,,  is  the  largest  countable 
S\n  set  as  described  in  the  previous  paragraph.  These  models  are  “higher  analogs” 
of  L since  C2  = “&>n  L so  that  L[Ct\  = L.  Moschovakis  observed  that  L[C2„]  is 
also  absolute  for  relations  and  so  satisfies  Det(A2n_| ) if  V does,  and  Kechris 
showed  that  in  fact  it  has  a X^-good  well-ordering,  and  so  does  not  satisfy 
Det(£2„_i)  (by  27.14  and  the  argument  for  13.10).  Martin  then  established  in 
1973  a somewhat  unexpected  result  that  complemented  this: 

30.10  Theorem  (DC)(Martin).  For  n € co,  Det(A2n)  implies  Det(£2n). 

Kechris- Solovay  [85: 196ff]  provided  a simple  proof  of  this,  and  Kechris- 
Woodin  [83: 1784]  drew  a stronger,  contextually  optimal  determinacy  conclusion. 

Becker  [78]  and  Moschovakis  [80:  8G]  provide  detailed  accounts  of  the  theory 
of  partially  playful  universes,  with  both  considering  models  L[72„+i]  naturally 
defined  from  a scale  and  the  latter  also  introducing  similarly  defined  models  //2„+i- 
Discussed  is  an  informative  result  from  Harrington-Kcchris  [81]  on  simulating 
ordinal  games  that  showed,  generalizing  previous  work  of  Kechris-Martin  [78], 
that  Det(A2n+2)  implies  that  wco  fi  L[72„+i]  = C2«+2,  the  largest  countable  X72'n+2 
set.  Moschovakis  [81]  provided  an  analogous  characterization  of  V{X)FH2n+\  for 
many  A’s;  for  such  X Becker  [80]  established  an  analogue  of  the  perfect  set  property 
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for  X c V(X)  (proved  independently  by  Ramez  Sami),  verified  ( 2X  = x+)H2'i+1 
in  some  cases,  and  extended  the  work  of  Kechris  [75];  and  Becker-Moschovakis 
[81]  studied  the  measurable  cardinals  in  these  models. 

The  T2n+i  above  is  in  fact  the  tree  corresponding  to  a n\n+x -scale  (cf.  30.2) 
on  a “complete”  n\n+x  set.  Answering  a long-standing  question  of  Moschovakis, 
Becker-Kechris  [84]  showed  that  L[T2n+\]  is  independent  of  the  choice  of  the 
complete  set  and  the  scale  on  that  set,  and  moreover  that  L[T2n+x\  = H2n+\.  This 
is  a central  result  that  unified  the  theory  of  partially  playful  universes  and  revealed 
a strong  canonicity  for  the  concept  of  scale. 

Recent  results  about  the  extent  of  scales  are  surveyed  toward  the  end  of  this 
section.  Before  then  a theory  of  lengths  of  prewellorderings  is  developed,  one  that 
was  buttressed  by  the  results  on  scales  through  their  tree  characterization  30.2. 

Projective  Ordinals 

Moschovakis’s  groundbreaking  work  was  also  to  lead  to  elegant  structural  analyses 
of  the  projective  sets,  generalizing  Suslin’s  fundamental  theorem  12.1  that  a set 
is  Aj  iff  it  is  Borel,  and  Sierpinski’s  result  13.7  that  every  set  is  a union  of 
Ki  Borel  sets.  K|  is  an  important  parameter  in  these  results,  and  for  the  higher 
projective  classes  new  ordinals  were  to  play  analogous  roles.  In  his  study  of 
prewellorderings  Moschovakis  [70]  had  introduced  the  following;  the  length  of  a 
well-founded  relation  is  that  ordinal  which  is  the  range  of  the  corresponding  rank 
function. 

8r  = sup({£  | § is  the  length  of  a prewellordering  C 2(wco)  in  Ar})  . 

In  particular,  for  1 < n e a>  the  projective  ordinals  are  defined  by 


These  ordinals  are  evidently  definability  analogues  of  0 discussed  at  the  end  of 
§28.  It  is  consistent  with  ZFC  that  they  are  at  most  co2,  as  0 could  be  co2,  but 
under  AD  things  turn  out  to  be  dramatically  different. 

Some  simple  observations  set  the  stage: 

30.11  Exercise. 

(a)  Suppose  that  A is  in  T and  there  is  a T -norm  p\  A ->  On  whose  range 
is  an  ordinal.  Then  p:  A ->  8p,  and  A — UM?  | f < <5r]  for  some  sets  all  in 
Ar- 

(b)  cf((5 r)  > a>,  and  S { = a>\. 

Hint.  For  (b),  note  that  the  union  of  countably  may  sets  in  T is  again  in  /’.  It 
remains  to  verify  that  5}  < a>\ : 

Suppose  that  ■<  c 2(coa>)  is  a Lj  prewellordering.  Set 

X = [jf  e | 3 1 yW°mW°n  (x ({m , n))  = 0 (y)m  < (>>)„)}  . 


424 


Chapter  6.  Determinacy 


Then  X is  L]  and  X C WF  in  the  terminology  of  13.6,  so  that  by  it  X C WF„ 
for  some  a < co\ . This  implies  that  the  length  of  < is  less  than  a>\ . ~\ 

Simple  conclusions  can  now  be  drawn  from  Second  Periodicity  30.8  for  the 
concept  of  y-Suslin  from  13.13: 

30.12  Exercise  (DC)(Moschovakis,  Kechris).  Assume  n e o>  and  DctfA),, ).  Then: 

(a)  Every  ^\n+2  set  is  8\n+l  -Suslin. 

(b)  Every  E^+i  set  is  y-Suslinfor  some  y < S*n+1. 

Hint.  For  (a),  it  suffices  by  13.13(d)  to  show  that  every  set  is  4+i  -Suslin, 

but  this  follows  from  30.9,  30.2,  and  30.11(a). 

For  (b),  it  suffices  to  show  that  every  II},,  set  is  y-Suslin  for  some  y < 8\n. 
For  such  a set  A,  there  is  by  30.9  a n^j-scale  on  A into  4+i-  Since  A is  in 
A2„+1,  each  of  the  prewellorderings  corresponding  to  norms  in  the  scale  is  actually 
in  \\n+l,  and  so  by  30.1 1(b)  there  must  be  a y < 4+1  bounding  all  their  lengths. 

H 


To  generalize  the  classical  Suslin  and  Sierpinski  results,  call  by  rj-union  the 
process  of  taking  the  union  [J{A|  | £ < ?;}  of  a (well-ordered)  sequence  of  sets 
(A|  | £ <17),  and  say  that  A C k(wa>)  is  y-Borel  iff  A is  in  the  smallest  collection 
of  sets  containing  the  open  subsets  of  k(°’co)  and  closed  under  complementation 
and  the  taking  of  ?;-unions  for  every  ;/  < y.  Thus,  A is  ft>i -Borel  iff  A is  Borel. 
Moreover,  for  any  y not  a cardinal,  A is  y -Borel  iff  A is  y+ -Borel. 

The  following  is  based  on  classical  arguments  going  back  to  Luzin-Sierpinski 
[18,23];  Martin  saw,  particularly  through  (b),  how  to  make  incisive  use  of  the 
generalizations. 

30.13  Exercise.  Suppose  that  A c k(a>co)  is  y-Suslin.  Then: 

(a)  A is  a y+-union  of  (y  + \)-Borel  sets,  and  so  is  (y+  + 1 )-Borel. 

(b)  If  cf(y)  > then  A is  y+ -Borel. 

(c)  If  B C k(fcS)  is  also  y-Suslin  and  AHB  — 0,  thenthereisa  (y  + l)-Borel 
set  C that  separates  A and  B,  i.e.  B C C and  AflC  = 0. 

Hint.  For  (a),  suppose  that  A = p[T\  where  T is  a tree  on  kco  x y.  Proceed 
as  in  the  proof  of  13.3,  first  defining  Bsa  analogously  for  s e <coy  and  a < y+ 
and  showing  by  induction  on  a that  they  are  (y  + l)-Borel.  Then  define  Ca 
analogously  for  a < y+,  and  show  that  they  too  are  (y  + l)-Borel.  Finally,  argue 
as  in  13.3  but  in  terms  of  complements  to  show  that 

A = Ua<y+(k(WCO)-Ca)  . 

For  (b),  amend  the  previous  argument  as  in  the  second  proof  of  13.7  (given 
after  13.14),  defining  — T (1  (k(<wco)  x <cu$)  for  $ < y and  observing  that 
A = U?<K  PiT as  cf(y ) > a>. 
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For  (c),  suppose  that  A — p[T\  and  B = p[U ] where  T and  U are  trees  on 
kco  x y.  Argue  as  for  13.4  with  k(a>co)  — A in  the  role  of  the  A there,  to  get  an 
a ' < y+  such  that 

B c {w  e k(0}a> ) | T„  is  well-founded  with  ||TW||  < a } . 

This  set,  in  a previous  notation,  is  (y  + l)-Borel  and  is  disjoint  from  A.  H 

The  following  is  a consequence  of  (c)  and  30.12(b). 

30.14  Exercise  (DC)(Moschovakis  [71]).  Assume  new  and  DetfA^).  Then 
every  A2„+1  set  is  8\n+l-Borel. 

Martin  used  his  analysis  of  £3  sets,  discussed  in  §15,  to  draw  an  unexpected 
conclusion  from  AD.  The  following  goes  much  of  the  way  to  that  result. 

30.15  Proposition  (Martin).  Assume  Vo  e :"o)(a:  exists).  If  cf(|«(U|)  > a>,  then 
every  £'  set  is  -Borel. 

Proof.  Every  £3  set  is  M^-Suslin  (15.12),  and  so  is  Mffl+-Borel  (30.13(b)).  But 
every  cardinal  < u a must  have  cofinality  cf(M2)  by  14.18(a),  and  so  it  is  readily 
seen  that  every  £3  set  is  in  fact  (u2  + 1 )-Borel.  To  complete  the  proof,  it  suffices 
to  observe  that  for  1 < n < co,  un  < ^3  (and  consequently  um  < by  30.11(b)). 
To  this  end,  note  that  each  u„  is  the  length  of  a prewellordering  in  A\\ 

By  14.18(b)  any  y < un  is  of  form  t^L^'€,afuu . . . , m„_i)  for  some  Skolem 
term  t and  a e aco.  Coding  pairs  (t,  a)  by  reals,  the  prewellordering  < given  by 

(t,  a)  < (t,  a)  iff  f<L[fl]-e'a>(Mi, . . . , it  11—  1 ) < t{Lral€’a)(uu  ■ ■ ■ , n„_i) 

is  then  A3  by  14.12(b)  relativized,  being  equivalent  to  c(a,b)#  containing  (a 
code  for)  a sentence  er(ci, . . . , c„_i)  depending  recursively  on  t and  7,  where 
c(<7,  b)  e w co  recursively  codes  a and  b.  H 

We  have  now  arrived  at  a notable  juncture.  Further  progress  cannot  be  made 
without  the  Coding  Lemma  of  Moschovakis  [70],  a technical  assertion  for  appli- 
cations that  necessarily  does  not  admit  a simple  statement.  Devised  in  the  study 
of  0 before  the  concept  of  scale  was  isolated,  the  lemma  incorporates  definability 
into  antecedents  of  H.  Friedman  and  Solovay  and  asserts  loosely  speaking  that 
bounded  subsets  of  0 and  functions  on  ordinals  less  than  0 are  definable.  The 
proof  requires  the  full  exercise  of  AD  in  that  winning  strategies  are  needed  for 
games  of  arbitrary  complexity,  and  for  the  incorporation  of  definability,  Kleene’s 
Recursion  Theorem.  This  intriguing  convergence  of  recursion  theory  and  a strong 
hypothesis  in  set  theory  was  perhaps  novel  at  first,  but  the  lemma  steadily  gained 
in  importance  through  various  applications  so  that  it  came  to  be  recognized  as  a 
cornerstone  of  the  theory  of  determinacy.  Moschovakis ’s  first  application  of  his 
lemma  was  to  establish  the  following  result: 


426 


Chapter  6.  Determinacy 


30.16  Theorem  (Moschovakis  [70:41]).  Assume  AD.  Then  for  1 < m e co  and 
rj  < Sjn,  any  rj-union  of  £*,  subsets  of  k(coco)  is  also  a set. 


See  Moschovakis  [80: 434ff]  for  a proof.  Incidentally,  in  ZFC  Martin’s  Ax- 
iom MA  + ->CH  + 3a  € = a>\)  also  implies  the  conclusion  for  m = 2 

(Martin-Solovay  [70:  §3]).  Before  the  emergence  of  scales  Martin  had  applied 
30.16  to  establish  the  converse  to  30.14  from  AD;  the  proof  is  given  assuming 
30.16,  partly  to  highlight  an  early  use  of  Wadge’s  Lemma  29.15: 


30.17  Theorem  (DC)(Martin). 

set  is  At 


2n+l' 


Assume  n e co  and  AD.  Then  ever y 8\n+l-Borel 


Proof.  By  the  definition  of  S^+j-Borel  set,  if  the  class  of  A2JI+1  subsets  of  *(“&>) 
is  closed  under  the  taking  of  /y-unions  for  every  ?;  < ^2n+l’  then  every  S])i+1-Borel 
set  is  A2,!+1.  It  thus  suffices  to  assume  that  the  class  is  not  so  closed  and  derive 
a contradiction. 

Suppose  then  that  rj  < S])i+1  is  least  such  that  there  is  an  fj-union  of  A2)I+1 
subsets  of  k(aco)  which  is  not  A2„+1,  and  let  A = [J|<  A|  be  such  an  ?;-union. 
By  30.16,  A is  We  ^rst  show  that  A has  a £2»+rnorm: 

As  i ) < 8\n+l,  it  is  simple  to  see  that  there  is  a A^+j  prewellordering  of 10 co 
with  length  /?,  say  with  rank  function  p.  For  each  £ < rj,  C 2(<u&>)  defined  by 


R$(x,z ) x e AP(Z)  A p(z)  = | 
is  seen  to  be  A2„+1,  so  again  by  30.16,  R c 2(®<y)  defined  by 

R(x,z ) **  x e Ap(z) 

is  Ljji+i,  being  the  ?;-union  of  the  Rfs. 

By  the  minimality  of  r],  for  any  § < rj  the  sets  Uc<f  \ and  Uf<|  A;  are 
Aj n+\,  and  so  by  a similar  argument  S,T  c 2 (OJoj)  defined  by 

S(x,z)  x i Uf<p(;:)At  . and 

T(x,z)  X i Uf<p(z)At  • 

are  both  T,\n+V 

Finally,  let  v be  that  norm  on  A defined  by: 


v(x)  = least  § such  that  x e A|  . 


Then 

x <*  y 3lz(R(x,  z ) A S(x,  z)) 
x <*  y **  3lz(R(x,  z)  A T(x,  z)) 

both  are  evidently  Ejh+i,  and  so  v is  a 2-2«+rnorm- 
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The  proof  is  completed  with  Wadge’s  Lemma  29.15.  For  any  L^+i  set  B °f 
reals,  either  A <w  mw  — B or  B <w  A.  But  the  first  alternative  readily  implies 
that  A is  Yl\n+X  contrary  to  assumption.  On  the  other  hand,  the  second  alternative 
readily  implies  that  B has  a Ljn+i  norm  induced  by  v.  Applying  homeomorphisms 
to  encompass  subsets  of  k(fw),  it  thus  follows  that  Ljn+i  has  the  prewellordering 
property,  contradicting  29.14.  H 

This  leads  to  that  unexpected  result  heralded  by  30.15: 

30.18  Theorem  (Martin).  Assume  AD.  Then  ua,  = 

Proof.  First  observe  that  for  1 < n e w,  AC iof°w)  together  with  the  existence  of 
a surjection:  Ww  —>  V(w„)  (28.15,28.16)  readily  imply  that  cf(o>„)  > w.  Flence, 
the  least  uncountable  cardinal  of  cofinality  co  is  wa. 

If  DC  were  assumed,  we  could  now  argue  as  follows:  Since  sharps  exist  under 
AD  and  there  are  Li  sets  that  are  not  Aj,  30.15  and  30.17  imply  that  cf(|M<y|)  = co. 
But  then,  the  previous  observation  together  with  co\  = u\  < < wm  imply  that 

Uq)  — CO  (O’ 

Finally,  the  use  of  DC  can  be  eliminated  by  analyzing  the  appeal  to  29.14  in 
the  argument  for  30.17:  First,  the  prewellordering  property  for  L^+i  is  derived. 
Since  the  prewellordering  property  implies  the  reduction  property  and  the  latter 
cannot  hold  for  both  L^+i  and  Il2n+1,  a contradiction  results  from  Il2„+1  having 
the  prewellordering  property.  It  is  only  this  last  appeal  to  29.14  that  depends 
on  DC.  However,  in  the  29.13  derivation  of  the  prewellordering  property  for  Ilj 
from  DefiAj),  DC  is  only  used  to  get  the  well-foundedness  of  the  prewellordering 
relation,  the  classical  prewellordering  property  for  L,  not  being  dependent  on  DC. 
The  other  properties  short  of  well-foundedness  suffice  for  the  29.7  derivation  of 
the  reduction  property  for  II3  from  the  prewellordering  property,  and  hence  the 
contradiction.  H 

This  is  the  result  from  which  was  derived  the  remarkable  consequence  of  AD 
that  cf (a>„)  = a>2  for  2 < n e co  (28.8),  and  indeed  that  un  = con  for  1 < n e co 
(28.9). 

The  following  are  two  characterizations  depending  on  30.16  that  generalize 
classical  results.  30.14  and  30.17  provide  a generalization  of  Suslin’s  result  12.1 
to  the  odd  levels  of  the  projective  hierarchy: 

30.19  Theorem  (DC)(Martin,  Moschovakis).  Assume  AD.  Then  for  new  and 
A C *("«), 

A is  A2»+i  W A is  82n+rB°reI  ■ 

Moschovakis  established  the  following  extension  of  Sierpinski’s  13.7  in  the 
wake  of  scales: 

30.20  Theorem  (DC)(Moschovakis  [71]).  Assume  AD.  Then  for  new  and 
A C *•'("«), 
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A is  S2„+2  iff  A is  a 8\n+l-union  of  A2n+ , subsets  of  k(mco)  . 

In  particular, 

A is  £3  ff  A is  a union  of  Ki  Borel  subsets  of  k(fa>)  . H 

See  Moschovakis  [80:  435]  for  a proof.  As  for  the  extent  of  closure  of  ^2,1+2 
instead  of  an  analysis  in  terms  of  simpler  sets,  Kechris  [78a]  used  this  theory  to 
observe  in  ZF  + DC  + AD  that  for  arbitrary  q any  q-union  ofL \n+2  subsets  of 
k(wco)  is  2^2„+2-  However,  this  may  be  vacuous  for  large  q,  with  the  following 
question  unresolved: 

30.21  Question  (DC).  Assume  AD.  Is  there  a sequence  of  distinct  £2/1+2  sets  °f 
reals  of  length  if2n+f+?  In  particular,  is  there  a sequence  of  £2  sets  °f  reals  of 
length  a>2? 

This  is  a major  open  question  in  the  area;  partial  results  appear  in  Jackson- 
Martin  [83]  and  Jackson  [90],  e.g.  there  no  such  sequences  that  are  increasing 
under  c. 

With  the  elegant  structure  results  30.19  and  30.20  recasting  the  projective 
sets  in  terms  of  well-ordered  unions  of  Borel  sets,  it  remained  to  get  an  extrinsic 
understanding  of  the  possible  lengths  of  these  unions,  i.e.  the  projective  ordinals. 
Recalling  that 

<$!  = op 

is  essentially  a classical  result  (30.1 1(b)),  the  following  summarizes  the  first  results 
that  situated  the  further  projective  ordinals  in  the  transfinite  under  AD: 

Moschovakis  [70]  had  used  his  Coding  Lemma  to  show  under  AD  that  the 
projective  ordinals  are  cardinals,  and  that  for  n e co,  8\n+x  is  regular  and  less  than 
&\n+2-  Martin  showed  (without  AD)  that  S2  < u>2  and  so  concluded  with  AD  that 

= a>2  ■ 

Using  his  analysis  of  £3  sets,  Martin  showed  (from  just  the  existence  of 
sharps)  that  ^3  < w^i,  and  so  concluded  with  AD  that 

£3  = U>w+\  , 

the  third  regular  uncountable  cardinal  under  AD.  Martin’s  results  followed  from  a 
basic  result  on  the  lengths  of  well-founded  relations  established  by  him  and  Kunen 
independently: 

30.22  Theorem  (DC)(Kunen;  Martin).  Suppose  that  < C 2('"o>)  is  a well-founded 
relation  which  is  K-Suslin.  Then  the  length  of  < is  less  than  k+ . 

The  following  is  then  a simple  consequence  of  30.12(a): 
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30.23  Corollary  (DC).  Assume  n e co  and  DeUA,,,).  Then  S2n+2  < (<$2„+i)+- 


It  follows  under  AD  that 

<$4  = tt>( 0+2  ■ 

Some  results  generalizing  properties  of  these  first  projective  ordinals  were 
soon  established.  Kechris  [74]  showed  that  for  n e a>,  <$2,!+1  is  the  cardinal 
successor  of  a cardinal  of  cofinality  a>  and  that  S]n+2  < <$2„+3.  Generalizing 
Solovay’s  early  measurability  results,  Martin  showed  that  for  each  n e a>,  &\n+\  is 
measurable,  and  then  Kunen  showed  that  <$2,1+2  also  measurable.  The  following 
summarizes  what  was  known  by  the  early  1970’s;  for  proofs  see  Kechris  [78]. 


30.24  Theorem  (DC).  Assume  AD.  Then  (<$,j  | 1 < n e a>)  is  a strictly  increasing 
sequence  of  measurable  cardinals  such  that  for  odd  n,  Sjt  is  the  successor  of  a 
cardinal  of  cofinality  co  and  <$(|+1  = (<$,|)+.  Moreover,  <$J  = a>i,  8\  — (02,  <$3  = 
tt>w+i,  and  <$4  = (om+2  are  the  first  four  regular  uncountable  cardinals. 


But  which  cardinal  is  <$5  under  AD?  This  became  the  focal  question,  one  that 
was  to  remain  unanswered  for  well  over  a decade.  It  was  felt  that  once  <5g  was 
determined,  the  techniques  developed  might  suffice  to  determine  all  the  projective 
ordinals.  Moreover,  the  question  was  related  to  a refined,  structural  approach  to 
the  Continuum  Problem:  Stepping  back  into  ZFC,  we  have  the  classical  8 } = a>\, 
and  Martin’s  <$]  < a>2 . Under  the  existence  of  sharps,  Martin’s  AC  result  15.13 
together  with  30.22  imply  that 

<$3  < ft>3  . 

(Some  assumption  is  necessary  here,  for  Harrington  [77]  established  the  consis- 
tency of  <$3  being  arbitrarily  large  in  ZFC.)  Mindful  of  how  the  consequences 
of  AD  were  increasingly  viewed  as  consequences  of  ADL(R)  in  L(R),  one  more 
deduction  can  be  made:  Note  first  that  the  notions  having  to  do  with  projective 
sets  like  the  <$,'  ’ s are  readily  seen  to  be  absolute  for  L(R).  Assuming  ADL(R)  and 
comparing  the  fourth  regular  uncountable  cardinals  in  L(R)  and  in  V, 


ol  _ ,L(R) 
°A  — Ww+2 


< OJ4 


Here  was  the  new  approach  to  the  Continuum  Problem.  With  the  projective 
ordinals  being  definability  analogues  of  © construed  as  the  length  of  the  contin- 
uum, it  may  be  that  ADi(R>  imposes  similar  upper  bounds  on  all  the  projective 
ordinals.  A version  of  a 1970  conjecture  of  Martin’s  is  that  the  hypothesis  implies 
that  for  every  1 < n e a>,  <5,j  < a>„.  But  even  8\  < a>m  could  not  be  established. 

How  these  issues  came  to  be  resolved  in  the  1980’s  is  discussed  at  the  end 
of  this  section.  Although  the  ground  was  to  lie  fallow  after  1971  for  well  over  a 
decade,  the  remarkable  progress  made  by  Kunen  in  that  year  should  be  mentioned. 
Beyond  what  was  chronicled  above,  he  proceeded  to  “dualize”  the  tree  represen- 
tation of  E3  sets  to  get  one  for  the  II3  sets  (as  suggested  at  the  end  of  §15),  and 
forging  ahead,  similarly  comprehended  the  II4  sets  (see  Kechris  [81]  and  Solovay 
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[78]).  This  work  stood  as  the  high  point  of  the  contextually  optimal  analysis  of 
projective  sets  until  further  progress  could  be  made  by  the  determination  of  <5^ . 

The  Extent  of  Scales 

Having  developed  the  theory  of  scales  and  projective  ordinals  available  in  the 
early  1970’s  this  section  is  brought  to  a close  with  a survey  of  the  further  work, 
starting  with  the  investigation  of  the  extent  of  scales  beyond  the  projective  classes. 
As  Steel  [83: 107]  put  it. 


Scales  are  important  in  Descriptive  Set  Theory  because  they  provide  the  only  known 
general  method  which  will  take  arbitrary  definitions  in  a given  logical  form  of  sets  of 
reals,  and  produce  definitions  of  members  of  those  sets. 


Using  a process  known  as  inductive  definability  Moschovakis  [78]  formulated 
the  class  IND  of  inductive  sets  as  a natural  extension  of  the  projective  sets.  For 
a succinct  characterization,  A C k(a>co)  is  inductive  iff  there  is  an  open  set  B C 
such  that 

A(w)  **  [V1Xo31XiV1X231X3  . . .}5(w,  (Xo,  X\,  X2,  *3,  ...)), 

where  the  real  game  quantifier  xfif  X\1X  xff1  X3 . . .}  has  the  expected  interpre- 
tation and  { xq , X\,X2,  X3, . . .}  is  that  real  x such  that  (x),-  = x,  for  every  i e to. 
Applying  (a  general  form  of)  his  Second  Periodicity  Theorem  30.8  Moschovakis 
showed  under  Det(AiND)  that  the  inductive  sets,  both  in  boldface  and  lightface 
forms,  have  the  scale  property.  (See  Moschovakis  [80:  7C].)  After  a lull  of  sev- 
eral years,  the  extent  of  definable  scales  was  considerably  clarified  as  outlined  in 
Martin-Moschovakis-Steel  [82]: 

Moschovakis  [83]  realized  that  with  sufficient  determinacy  members  of  the 
dual  class  of  coinductive  sets  have  scales  definable  not  much  beyond  inductivity 
in  complexity.  Martin  [83]  noted  however  that  this  complexity  cannot  be  re- 
duced in  general.  Building  on  Moschovakis’s  construction  Martin-Steel  [83]  then 
demarcated  the  extent  of  scales  in  L(R): 

With  third-order  arithmetic  A 3 the  expected  expansion  of  A2  incorporating 
Vi01  co)  with  new  quantifiers  V2  and  32  ranging  over  that  domain,  A C k(‘°co)  is 

^ iff 

A(w)  **  A2  |=  32X<p[w]  , 

where  cp  is  a formula  with  only  real  and  number  quantifiers.  The  dual  class  77 12 
is  defined  analogously  with  V2  in  place  of  32,  and  the  incorporation  of  parameters 
a e m co  leads  to  the  boldface  versions  £2  and  II2.  Correlating  with  the  inductive 
sets  Martin-Steel  [83]  first  showed  that  A is  (L2)i(R)  (i.e.  32  is  restricted  to  T(ma>)C\ 
L(R))  iff  there  is  an  Ilj  set  B c k+l(‘°(d)  such  that 


A(w)  **  [V1xo31xiV1X231X3  . . .}B(w,  (xo,  xi,  X2,  X3, . . .))  . 
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They  then  established:  Assume  AD  and  V — L(R).  Then  E2  has  the  scale  prop- 
erty. This  is  indeed  a demarcation,  by  a 1976  observation  of  Kechris  and  Solovay 
based  on  an  early  counterexample: 

30.25  Exercise  (Kechris  and  Solovay).  Suppose  that  V = L(R),  and  there  is 
no  well-ordering  of  the  reals.  Then  there  is  a Ilf  subset  of  2(‘°co ) that  cannot  be 
uniformized. 

Hint.  Verify  that  the  set  A of  27.16  is  a counterexample  by  checking  that  1 y e 
OD(v)l  is  Xj2:  First  use  a Lowenheim-Skolem  argument  to  show  that  definability 
using  parameters  from  OnU  (x)  in  L(R)  is  equivalent  to  such  definability  in  Ly(R) 
for  some  y < 0.  (This  recalls  the  proof  of  the  Reflection  Principle  for  ZF  in 
its  role  for  formalizing  the  concept  of  ordinal  definability.  Remembering  that 
cf(@)  > co  in  L(R),  an  argument  in  to  stages  minimizing  according  to  a definable 
surjection:  On  x 01  oo  — »•  L(R)  is  helpful  to  get  y < &.)  Then  note  that  any  Ly(R) 
for  y < 0 can  be  coded  by  a set  of  reals.  H 

In  further  work  Martin  [83a]  showed  that  assuming  corresponding  versions 
of  ADr,  if  a reasonably  closed  class  A has  the  scale  property,  then  so  does  the 
class  consisting  of  sets  defined  via  the  real  game  quantifier  preceding  A relations, 
and  moreover  via  analogous  quantifiers  corresponding  to  games  of  any  countable 
length.  A high  plateau  was  reached  by  Steel  [83]  who  developed  a “fine  structure” 
theory  for  L(R)  and  considerably  refined  the  previous  results.  Analyzing  the 
minimal  complexity  of  scales  in  L(R),  Steel  essentially  extended  the  structure 
theory  for  the  projective  sets  provided  by  determinacy  to  all  sets  of  reals  in  L(R). 

In  the  next  advance  upward,  Woodin  proved  in  the  mid-1980’s  that  ADr 
implies  that  eveiy  set  of  reals  has  a scale.  This  implies  by  the  previous  work  of 
Martin  that  there  are  nonselfdual  classes  arbitrarily  high  in  the  Wadge  ordering  <w 
with  the  scale  property  and  reasonable  closure  properties.  Woodin  also  established, 
relative  to  large  cardinals,  the  consistency  of  the  existence  of  an  inner  model 
containing  every  real  and  satisfying  ADr  - but  here  we  are  treading  into  matters 
taken  up  in  §32. 

The  Structure  of  L(R) 

With  L(R)  seen  from  the  beginning  as  the  natural  inner  model  for  determinacy, 
not  only  were  the  consequences  of  AD  increasingly  regarded  as  what  holds  in 
L(R)  assuming  ADL(R),  but  the  structure  of  the  inner  model  came  to  be  studied 
for  its  own  sake  and  as  an  avenue  to  relative  consistency  results.  In  contrast  to  the 
investigation  of  L which  can  be  regarded  as  the  extended  analysis  of  a minimal 
principle,  this  study  of  L(R)  sought  clarification  of  a maximal  principle  in  sharp 
focus.  The  sophisticated  Steel  [83]  analysis  of  scales  in  L(R)  was  the  high  point 
of  the  internal  analysis,  and  was  to  serve  as  a basis  for  the  wide-ranging  assertions 
about  AD  in  L(R)  that  resolved  various  issues  raised  in  the  1970’s.  Kechris  [85] 
provides  a summarizing  account.  As  already  with  the  extent  of  scales,  the  short 
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discussion  which  follows  can  convey  little  of  the  depth  and  complexity  of  the 
results  being  surveyed. 

As  described  in  §28  determinacy  has  a strong  effect  on  the  structure  of  the 
transfinite  cardinals  up  to  the  natural  bound  0.  Taking  up  again  the  discussion  of 
the  infinite  exponent  partition  relations, 

k has  the  strong  partition  property  iff  Va  < k(k  — >■  (k)£)  . 

By  Martin’s  28.13,  AD  implies  that  a>\  has  the  strong  partition  property.  For  some 
time  there  was  no  other  example  known,  the  focal  question  of  whether  S]  has  the 
strong  partition  property  remaining  open.  Then  in  late  1977  Kechris  bypassed  the 
projective  ordinals,  and  taking  features  of  Martin’s  proof  and  using  properties  of 
the  class  IND  of  inductive  sets,  established  that  under  AD  + DC  there  is  a cardinal 
k with  the  strong  partition  property  such  that  {A.  < k \ X has  the  strong  partition 
property  } is  stationary  in  k.  The  argument  was  soon  extended  to  establish  the 
following: 

30.26  Theorem  (DC)(Kechris-Kleinberg-Moschovakis-Woodin  [81]).  Assume 
AD.  Then  © is  a limit  of  cardinals  having  the  strong  partition  property. 

These  cardinals  with  the  strong  partition  property  are  analogous  to  u>\  in  its 
role  as  Sj  in  the  proof  of  Martin’s  28.12,  and  a uniform  version  of  the  Moschovakis 
Coding  Lemma  was  used  to  code  functions:  k ->  k by  reals,  replacing  Martin’s 
coding  via  sharps.  The  possibility  of  a converse  in  L(R)  was  raised,  and  this  was 
established  by  Kechris  and  Woodin  in  1982: 

30.27  Theorem  (DC)(Kechris- Woodin  [83]).  Assume  V = L(R).  Then  the  fol- 
lowing are  equivalent: 

(a)  AD. 

(b)  0 is  a limit  of  cardinals  having  the  strong  partition  property. 

This  illuminating  characterization  reduced  AD  to  a purely  combinatorial  as- 
sertion about  the  transfinite  cardinals,  and  vindicated  the  early  interest  in  infinite 
exponent  partition  properties.  Kechris-Kleinberg-Moschovakis- Woodin  [81]  had 
already  shown  that  if  a set  A of  reals  is  X-Suslin  (i.e.  A — p[T]  for  some  tree  T on 
cu  x A.)  and  there  is  a k > A.  with  the  strong  partition  property,  then  A is  determined. 
Loosely  speaking,  the  tree  representation  was  used  as  in  the  § 1 5 analysis  of 
sets,  and  the  strong  partition  property  applied  to  get  an  invariance  for  ordering 
functions  (cf.  the  end  of  §15).  With  this,  they  had  achieved  a weak  version  of 
30.27.  What  Kechris- Woodin  [83]  established  is  that  in  L(R),  if  every  set  of  reals 
X-Suslin  for  some  X is  determined,  then  AD.  Proceeding  level  by  level  through 
the  fine  structure  hierarchy  of  L(R),  the  argument  used  the  Steel  [83]  analysis  of 
scales,  in  their  role  of  witnessing  the  Suslin  property,  and  applied  a technique  of 
Martin  [83]  for  handling  quantifier  alternations  over  E using  his  filter  over  Turing 
degrees.  Woodin  later  confirmed  that  (b)  cannot  imply  (a)  in  ZF  + DC  alone,  by 
showing  that  (b)  is  consistent  with  the  existence  of  an  ultrafilter  over  a>. 


30.  Scales,  Projective  Ordinals,  and  L( R) 


433 


Another  characterization  established  around  the  same  time  addressed  the  in- 
volvement of  Turing  degrees  in  determinacy.  Consider  Turing  Determinacv, 

Det({UV  I V c VT})  , 

i.e.  the  hypothesis  that  every  set  of  reals  closed  under  Turing  equivalence  =t  is 
determined.  Turing  Determinacy  can  replace  full  AD  in  Martin’s  result  28.4  that 
his  filter  over  Turing  degrees  is  ultra  and  at  various  junctures  (cf.  the  proofs  of 
First  Periodicity  29.13),  and  the  question  of  its  strength  was  raised  early  on.  In  the 
early  1980’s  Woodin  pushed  the  techniques  that  had  become  available  to  establish 
the  following: 

30.28  Theorem  (DC)(Woodin).  Assume  V = L( R).  Then  the  following  are 
equivalent: 

(a)  AD. 

(b)  Turing  Determinacy. 

A decade  later  Woodin  was  to  establish  an  equivalence  in  L(R)  for  AD 
merely  in  terms  of  some  of  its  structural  consequences  for  sets  of  reals  (32.22). 

A more  basic  issue  had  to  do  with  DC.  With  much  of  the  consequences  of 
AD  appealing  to  DC,  how  substantive  a role  does  DC  play  was  a question  often 
raised  during  the  1970’s.  Does  Con(ZF  + AD)  imply  Con(ZF  + AD  + DC)?  As 
mentioned  after  28.19,  Solovay  [78a]  established  that  Con(ZF  + ADr)  implies 
Con(ZF  + ADr  h — 'DC),  and  hence  the  independence  of  DC  from  AD  relative  to 
ADr.  It  was  Kechris  who  eventually  settled  the  consistency  question: 

30.29  Theorem  (Kechris  [84]).  Assume  AD  and  V — L(R).  Then  DC. 

Thus  the  role  of  L(R)  as  an  inner  model  for  ZF  + AD  became  more  fully 
analogous  to  the  role  of  L for  ZF.  Just  as  ZF  + V = L procures  AC,  ZF  + AD 
+ V — L(K)  procures  DC,  and  so  indeed  Con(ZF  + AD)  implies  Con(ZF  + AD 
+ DC)  by  absoluteness.  The  proof  also  used  the  Steel  [83]  analysis  of  scales,  in 
their  role  of  securing  uniformization  as  a means  to  establish  DC. 

Woodin  was  able  to  replace  ADR  by  AD  in  Solovay’ s DC  independence 
result: 

30.30  Theorem  (Woodin  - Kechris  [84:  §3]).  Assume  AD  and  V — L(R).  Then 
in  a forcing  extension  there  is  an  inner  model  satisfying  AD  H — 'AC^,. 

Thus,  Con(ZF  + AD)  implies  Con(ZF  + AD  -l — 'DC)  and  the  independence 
of  DC  relative  to  AD. 

Woodin’s  counterexample  to  ACW  occurs  high  in  the  cumulative  hierarchy 
however,  and  toward  the  formulation  of  a refined  question  consider  DC  restricted 
to  relations  on  reals: 
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(DCr&O)  WR(R  C 2Cw)  A V'xl'y {{x,  y)  € R) 

->  3/  e a,r«)V°»((/(«),  / (n  + 1)}  e /?))  . 

30.31  Question  (Woodin).  Does  AD  imply  DC  (fed)? 

An  independence  result  here  would  seem  to  require  inner  models  of  AD 
qualitatively  different  from  the  onces  concocted  thus  far. 

Yet  another  issue  clarified  by  L(R)  is  the  size  of  0.  Recall  from  §28  that 
under  AD,  there  are  cardinals  k that  are  k -weakly  Mahlo  below  0,  and  under  AD 
+ V — L(R),  0 itself  is  weakly  inaccessible  (28.18  and  before,  and  28.19). 

30.32  Theorem  (DC)(Kechris  and  Woodin  - Kechris  [85]).  Assume  AD  and 
V — L(W).  Then: 

(a)  0 is  0-weakly  Mahlo. 

(b)  0 is  not  weakly  compact,  i.e.  there  is  a 0-tree  without  a 0-branch. 

(c)  0 is  the  supremum  of  the  measurable  cardinals. 

(a)  established  that  0 is  hierarchically  large  in  L(R),  but  (b)  imposed  a 
limitation,  (c)  provided  a purely  combinatorial  description  of  0,  and  was  a con- 
sequence of  the  arguments  toward  (a);  a question  related  to  it  is  stated  at  the  end 
of  the  section. 

The  Determination  of  the  Projective  Ordinals 

The  structural  analysis  under  AD  of  the  projective  sets  was  essentially  completed 
in  the  mid-1980’s  with  the  calibration  of  the  projective  ordinals  in  the  sequences 
of  cardinals.  This  was  a veritable  tour  de  force  of  technical  virtuosity  by  Steve 
Jackson,  a student  of  Martin.  More  in  the  fullness  of  time  rather  than  by  the  weight 
of  new  techniques,  Jackson  determined  <5^  in  his  1983  U.C.L.A.  dissertation.  The 
starting  point  was  some  observations  of  Martin  on  normal  ultrafilters  over  <5| 
leading  to  a putative  lower  bound  for  <5^ . By  1985  Jackson  had  carried  out  the 
determination  of  all  the  projective  ordinals  with  the  large  part  of  the  upper  bound 
calculations  provided  in  his  formidable  [88],  30.24  summarizes  what  had  been 
known  before  with  &\n+2  = (^2«+i)+  reducing  the  analysis  to  the  <$^’s  for  odd 
m;  define  ordinals  E(n)  for  n e a>  by  recursion  as  follows:  /AO)  = 1,  and 
E(n  + 1)  = coE(n)  via  ordinal  exponentiation. 

30.33  Theorem  (DC)(Jackson  [88]).  Assume  AD.  Then  for  n e co, 

S2n+3  = WE(  2«+l)+l 

and  <52„+3  has  the  strong  partition  property. 

The  Sjfs  for  odd  m are  thus  the  successors  of  cardinals  indexed  by  a cor- 
responding tower  of  <y’s,  starting  with  Martin’s  <$3  = <ww+i  (cf.  before  30.22). 
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That  they  have  the  strong  partition  property  is  both  an  outgrowth  and  a crucial 
component  of  the  proof,  which  proceeds  by  induction.  Beyond  the  Martin  result 
28.13  that  (5}  has  the  strong  partition  property,  even  that  <5|  has  the  strong  parti- 
tion property  had  not  been  known.  The  inductive  scheme  can  be  elaborated  at  the 
basis: 

As  motivated  through  methodological  considerations  at  the  end  of  §15,  Kunen 
had  applied  Martin's  strong  partition  property  for  Sj  to  get  ultrafilters  over  cow  (= 
Mffl)  that  provide  the  necessary  homogeneity  properties  for  the  contextually  optimal 
tree  representation  of  11  1 sets.  Forging  ahead,  Kunen  carried  out  a detailed  analysis 
of  these  ultrafilters  and  their  ultrapowers  (see  Solovay  [78]).  This  led  to  the 
property  5]  — > (5j)“  for  every  a < S],  which  in  turn  provided  the  ultrafilters 
over  <5j  (cf.  28.10)  necessary  for  the  optimal  tree  representation  of  II4  sets. 

<5^  is  characterizable  in  terms  of  a certain  supremum  of  ultrapowers  via  those 
ultrafilters  over  <$3  (see  Kechris  [8 1 : 62]),  and  Martin  had  noted  that  only  certain 
canonical  functions  need  be  considered  in  the  calculation  of  this  supremum.  Ap- 
plying this  Jackson  was  able  to  complete  the  calculation  by  an  intricate  analysis 
that  further  reduced  the  supremum  to  one  of  ultrapowers  via  certain  canonical  ul- 
trafilters. In  the  process,  he  was  able  to  lift  the  Kunen  situation  to  the  next  level: 
the  strong  partition  property  for  <5|  (not  just  the  “weak”  one  mentioned  above)  and 
a detailed  analysis  of  the  ultrafilters  over  5]  and  their  ultrapowers  that  leads  to  the 
optimal  tree  representation  of  II5  sets.  With  sufficient  inductive  hypotheses  thus 
in  place  Jackson  completed  the  calculation  of  all  the  projective  ordinals,  getting 
the  optimal  tree  representations  for  all  the  projective  sets. 

The  Jackson  analysis  confirmed  that  under  AD  the  regular  cardinals  strictly 
between  <$2„+1  and  ^2n+3  are  exactly  the  ultrapowers  of  &\n+x  via  the  normal  ul- 
trafilters over  o2n+\  derived  from  its  strong  partition  property.  Moreover,  these 
ultrapowers  are  distinct,  and  by  28.11  there  are  just  as  many  as  there  are  regular 
infinite  cardinals  below  S2n+]  ■ With  this  in  mind,  define  F:  co  — »•  a>  by:  ,F(0)  = 1 
and  F{n  + 1)  = 2 F(n)  + 1.  Then  under  AD  there  are  exactly  F{n ) regular  infinite 
cardinals  below  S^i+i  for  every  n e a>: 

This  is  clear  for  n = 0,  and  for  n — 1,  with  the  F(\)  = 3 cardinals  below 
S]  = ®w+1  being  co,  an,  and  &>2  • Assuming  inductively  that  there  are  F(n)  regular 
infinite  cardinals  below  it  follows  that  there  are  F{n)  regular  cardinals 

strictly  between  ^2n+1  and  <$2,!+3,  and  so  counting  the  regular  infinite  cardinals 
below  8\ n+\  and  <i\n+\  itself,  there  are  exactly  2 F(n)  + 1 = Fin  + 1)  regular 
cardinals  below  <$2„+3- 

The  Jackson  analysis  thus  established  in  ZFC  that  under  ADi(R)  the  projective 
ordinals  are  all  less  than  coa.  In  fact,  noting  once  again  that  the  S^’s  are  absolute 
for  L(R)  and  comparing  regular  cardinals  there  and  in  V,  the  list  of  inequalities 
after  30.24  can  be  continued  assuming  ADL(K)  with  the  following  for  every  n e w: 

&2n+l  < %W  and  &\n+2  - a>F(n)+ 1 ■ 

In  particular,  the  projective  sets  are  comprehended  in  ZFC  + ADi(R)  through  the 
characterizations  30.19  and  30.20  as  unions  of  X Borel  sets  with  the  k’s  concretely 
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inset  among  the  <w„’s.  This  is  a remarkable  advance,  mediated  by  determinacy, 
beyond  the  early  results  of  Suslin  and  Sierpinski. 

Taking  stock  however,  the  following  version  of  an  early  conjecture  of  Martin 
remains  unresolved: 

30.34  Question  (ZFC).  Does  ADi(R)  imply  that  < o>„  for  every  1 < n e u>? 

Jackson’s  analysis  raised  the  possibility  of  a similarly  detailed  analysis  of 
L(W)  right  up  to  0,  with  the  first  excursions  made  by  him  in  [91, 92],  A question 
thought  to  be  along  these  lines  was  answered  in  1993  by  Steel  and  Woodin,  in  the 
wake  of  important  advances  made  by  Steel  in  inner  model  theory.  Moschovakis 
and  Kechris  had  shown  that  under  ZF  + ADi(R),  every  regular  (in  V)  uncount- 
able cardinal  below  0i(R)  is  measurable  in  L(1R).  Jackson’s  work  affirmed  that 
under  AD  + DC,  every  regular  uncountable  cardinal  below  sup  ({ S | n e co})  is 
measurable. 

30.35  Theorem  (ZFC)  (Steel  and  Woodin).  Assume  AD  + V — L(M).  Then  eveiy 
regular  uncountable  cardinal  below  0 is  measurable. 

Taken  together,  the  various  results  through  the  early  1980’s  on  internal  ques- 
tions about  determinacy  and  the  structure  of  L(R)  provide  a cumulative  impression 
of  systematic  progress  using  more  and  more  sophisticated  methods  of  descriptive 
set  theory.  The  consistency  of  AD  from  the  vantage  point  of  ZFC  seemed  ever 
remote  however,  but  surprisingly  a denouement  in  this  direction  was  to  soon  take 
place,  and  this  notably  through  new  initiatives  involving  supercompact  cardinals 
as  described  in  §32. 
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The  consequences  of  determinacy  having  been  explored  in  the  previous  sections, 
this  one  begins  the  final  ascent  to  the  consistency  of  determinacy  hypotheses. 
Martin  forged  the  initiative  in  this  direction,  and  starting  afresh  with  his  ground- 
breaking work  in  1967  his  results  through  the  1970’s  are  described.  They  provided 
the  first  solid  evidence  towards  Solovay’s  general  conjecture  that  large  cardinals 
would  provide  the  consistency  strength  to  establish  determinacy.  The  next  section 
describes  the  definitive  work  of  Martin,  Steel,  and  Woodin  in  this  direction. 

Returning  to  safe  haven, 

ZFC  serves  as  the  ambient  theory  for  this  and  the  next  sections  . 

Proofs  are  increasingly  dispensed  with,  as  in  any  case  they  are  precluded  by  their 
range  and  sophistication,  in  favor  of  providing  a driving  account  of  the  progression 
of  ideas. 

In  1967,  Martin  struck  on  the  fertile  idea  of  using  &>i -complete  ultrafilters, 
viewed  as  two-valued  measures,  to  “integrate”  over  possibilities  to  get  structural 
results  about  the  projective  sets.  For  this  purpose  he  had  introduced  his  filter 
over  Turing  degrees  (cf.  the  first  proof  of  29.13),  and  this  had  led  to  his  proof 
of  Solovay’s  result  from  AD  that  a>\  is  measurable  (cf.  28.4).  The  integration 
idea,  mediated  by  the  sharps,  filtered  into  the  Martin-Solovay  [69]  work  on  Li 
sets  (§15),  and  was  to  plainly  appear  in  the  next  result.  That  Det(Ilj)  cannot 
be  established  in  ZFC  was  clear  (27.13);  Martin  established  that  it  does  follow 
from  the  existence  of  a measurable  cardinal.  Obliquely  complementing  Solovay’s 
deduction  of  measurability  from  AD  Martin’s  result  opened  the  door  to  the  pos- 
sibility of  getting  amounts  of  determinacy  from  large  cardinals,  a possibility  that 
was  to  be  pursued  with  remarkable  and  unexpected  consequences  in  the  years  to 
come. 

31.1  Theorem  (Martin  [70]).  Suppose  that  there  is  a measurable  cardinal.  Then 
Det(IlJ)  holds. 

Proof.  Suppose  that  A c 01  co  is  IlJ,  so  that  there  is  a tree  T on  co  x co  such  that 
for  any  x e wco, 

A(x)  **  Tx  is  well-founded  . 

That  A is  determined  will  be  argued  in  terms  of  well-orderings,  and  to  this  end, 
the  development  toward  the  beginning  of  § 1 5 is  adapted. 

First,  it  can  be  assumed  that  (0,  0}  e T . With  the  fixed  recursive  enumeration 
(s,-  | i e co)  of  <0}co  such  that  |s,j  < i,  for  any  x e “’co  let  <x  be  that  strict  linear 
ordering  of  co  defined  by: 

i <x  j -o-  s,'  ^ Tx  A s j £ Tx  A i < j ; else 
S;  ^ Tx  a s j e Tx  ; else 
Si  e Tx  a s j e Tx  a s,-  <kb 
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where  <kb  is  the  Kleene-Brouwer  ordering.  Then  for  any  x e "'co, 

A(x)  **  <v  is  a well-ordering  . 

Next,  for  s e <wco,  setting 

T{s)  = [w  | 3m  < |s|((s|m,  w)  e T)} 

let 

<s  = that  strict  linear  ordering  of  |s|  = {/  | i < |s|} 
defined  like  <x  but  with  Tx  replaced  by  . 

| Si  | < i implies  that  for  s c t both  in  <a>co,  <s  c <f  as  relations,  and  for  x e 

<x  = Umew  <x\m  ■ 

Finally,  suppose  that  at  is  a measurable  cardinal,  and  define  a tree  T*  on 
co  x k by 

T*  = {(s,  u)  | V/,  j < |s|(k(i)  < u(j)  i <s  j )}  . 

Then  for  any  x e 10 co,  since  <v  is  a well-ordering  exactly  when  there  is  an  order- 
preserving injection  of  (co,  <x)  into  (at,  <}, 

A(x)  ^ 3 g e“K((x,g)  e[T*])  . 

Proceeding  to  the  main  argument,  to  establish  that  Gt0(A)  is  determined 
consider  an  auxiliary  game  G*: 

/ : <x(0),g(0)>  <*(2),s(l)> 

II:  x(l)  x(3) 

I and  II  both  play  integers  x(i)  to  form  a real  x,  but  in  addition,  I plays  ordinals 
g(i ) < k to  form  a function  g:  co  — >■  k.  I wins  if  (x,  g)  e [T*],  so  that  g is  an 
order-preserving  injection  of  (co,  <x)  into  (at,  <}  and  x e A,  and  otherwise  II 
wins. 

If  I loses  in  G* , then  he  has  lost  irrevocably  at  a finite  stage:  He  will  have 
made  a move  (x(2i),  g(i))  for  some  i > 0 such  that  the  choice  g(i)  does  not 
preserve  the  order  of  <x,  and  this  can  already  be  seen  in  terms  of  the  initial 
segment  <x\2i  (as  |s,j  < i < 21).  Hence,  by  27.1  (cf.  the  comments  following) 
G*  is  determined.  If  it  is  I who  has  a winning  strategy  in  this  game,  then  he  has 
a winning  strategy  in  Gm(A)\  he  simply  keeps  the  g(i)’s  of  his  moves  to  himself 
and  plays  only  the  x(2i)’s.  If  it  is  II  who  has  a winning  strategy  in  G*,  it  remains 
to  invoke  the  measurability  of  k to  show  that  he  has  a winning  strategy  in  GW(A ): 

We  first  adapt  the  development  after  15.4:  Note  that  T*  has  the  following 
homogeneity  property:  For  any  s e <a)co,  set  T*  — {u  \ (s,  u)  e T*}.  Then  since 
<s  is  a strict  linear  ordering,  ran(«)  e [a-]1*1  for  any  u such  that  u e T*.  Moreover, 
for  any  w e [/r]|s|  there  is  a unique  u e T*  such  that  ran(u)  = w. 

With  this  in  mind,  let  U be  a normal  ultrafilter  over  k and  for  .?  e <0>co  define 
Us  by: 

X e Us  iff  X c Ts*  A 3H  e U(\H\M  c ran“Z)  . 
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Then  for  s f 0,  by  the  above  described  homogeneity  and  Rowbottom’s  7.17,  Us 
is  a k -complete  ultrafilter  over  T* . 

Suppose  now  that  r is  a winning  strategy  for  II  in  G*.  For  n e co,  t e 2"+1co, 
and  u e "+1at,  let  r (t,  u)  denote  the  strategic  response  according  to  r to 

((t  (0),u(0)),t(l (t(2n),  «(n)»  . 

Then  for  such  t and  k e co,  set 

Zt,k  = {u  e 7f|(n+i)  I r(t,  u)  = k}  . 

Next,  define  a strategy  r0:  Une®2"+1<w  to  by: 

r”o(t)  — k iff  Zjjr  g Ut\(n+\)  . 

This  is  well-defined  by  the  co\  -completeness  of  the  Uf  s.  The  proof  is  completed 
by  showing  that  r0  is  a winning  strategy  for  II  in  GW(A): 

Assume  to  the  contrary  that  x is  a play  according  to  r0,  yet  x e A.  By  the 
w\  -completeness  of  U there  is  an  Hq  e U such  that  for  every  n e co, 

[77o]  C Zv|(2n+l),r0(s|(2n+l))  • 

Since  H0  is  uncountable,  there  is  an  order-preserving  injection  g : (co,  <x)  -* 
(Hq,  <).  But  then,  (x,  g)  is  in  T*  and  corresponds  to  a play  of  G*  in  which  II 
plays  according  to  r and  still  loses.  Contradiction!  H 

As  in  the  analysis  of  Lj  sets  (§15)  the  hypothesis  can  be  weakened  to  the  existence 
of  sharps: 

31.2  Theorem  (Martin).  Suppose  that  a e 10 co  and  a#  exists.  Then  Det(/7j  (a)). 

Proof.  The  previous  argument  is  refined.  Construing  the  A as  Tl\(a),  it  can  be 
assumed  that  the  corresponding  tree  T is  in  L[a\  by  13.1.  Now  define  the  tree 
T*  from  T as  before  but  with  (the  real)  co\  in  place  of  the  measurable  k.  T*  and 
so  (the  payoff  set  of)  the  corresponding  game  G*  are  both  in  L\a],  and  arguing 
as  before,  the  game  is  determined  in  L[a]. 

Suppose  first  that  in  L[a],  I has  a winning  strategy  a in  G* . Then  in  V,  a is 
also  a winning  strategy  for  /,  since  if  he  moves  according  to  o and  still  loses,  this 
would  have  happened  at  a finite  stage,  and  so  also  in  L[a\.  It  follows  as  before 
that  I has  winning  strategy  in  GW(A). 

Suppose  on  the  other  hand  that  in  T[a],  II  has  a winning  strategy  r in  G*. 
Let  PT  consist  of  those  finite  partial  plays  according  to  r that  are  not  yet  secured 
for  II,  i.e.  those  that  are  extendible  to  a play  that  is  a win  for  I.  As  I loses  at  a 
finite  stage  when  he  loses  at  all,  it  follows  that  in  L[a],  (PT,  D)  is  well-founded. 
But  then,  by  absoluteness  of  well-foundedness  (0.3),  (PT,  D)  is  well-founded  in 
V.  r is  consequently  a winning  strategy  for  II  in  the  sense  of  V.  It  remains  to 
derive  a winning  strategy  for  II  in  GW{A): 
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With  Ia  the  closed  unbounded  class  of  ordinal  indiscernibles  for  L[«]  given 
by  a#,  let  y < a>\  be  sufficiently  large  so  that  in  some  Skolem  term  rendering 
of  r,  no  indiscernible  in  C — Ia  fl  (a>i  — y)  appears.  Now  define  a strategy 

ro;  U,,e(U2',+1«->  «by: 

r 0(r)  = A'  iff  3m  e "+1C((t|(«  + 1),  u)  e T*  A r (f,  u)  = k)  , 

where  r (f,  u)  is  defined  as  before;  ro  is  well-defined  by  indiscemibility.  Again, 
to  is  a winning  strategy  for  II  in  G m(A): 

Assume  to  the  contrary  that  x is  a play  according  to  to  yet  x e A.  Since  C 
is  uncountable,  there  is  an  order-preserving  injection  g:  (co,  <x)  — * (C,  <}.  But 
then,  by  indiscemibility  (x,  g)  corresponds  to  a play  of  G*  in  which  II  played 
according  to  r yet  lost.  Contradiction!  H 

Note  that  this  result  together  with  27.14  provides  a new  proof  of  the  version 
of  Solovay’s  result  14.3  that  if  Wa  e mco(a#  exists),  then  the  £'  sets  have  the 
regularity  properties. 

Martin  viewed  his  result  as  a plausibility  argument  in  two  complementary 
directions.  Looking  upward  he  wrote  prophetically  ([70:287]):  “We  believe  that 
larger  cardinals  will  yield  a generalization  of  our  proof  to  all  projective  sets.”  This 
accorded  with  Solovay’s  hopes  for  the  relative  consistency  of  ADL(R),  and  was  to 
be  borne  out  by  developments  in  the  1980’s.  In  the  other  direction,  he  saw  telling 
evidence  for  a theorem  of  ZFC: 

Martin  [70]  actually  drew  his  conclusion  from  the  existence  of  a cardinal  k 
satisfying  k — > (coOJ®.  Owing  to  the  boundedness  of  order-preserving  maps 
(cf.  13.4),  he  observed  that  if  there  is  a cardinal  k satisfying  Va  < co\{k  — > 
(a)f ®)  then  Borel  Determinacy  holds.  He  then  noted  that  the  partition  property 
relativizes  to  L (9.15),  and  although  Det(Tlj)  fails  in  L,  Borel  Determinacy  also 
relativizes:  The  assertion  that  Gt0{A)  is  determined, 

3erVy(cr  * y e A)  V 3r Vz(z  * r ^ A)  , 

is  S](a)  when  A is  A\(a),  and  hence  absolute  for  L when  a e L.  Note  that  Borel 
Determinacy  is  itself  a n\  assertion,  as  “V  Borel  A”  can  be  rendered  through  real 
codes  for  Borel  sets  (cf.  before  11.8)  as  a real  quantifier  V1. 

The  best  outright  determinacy  result  at  the  time  was  the  Davis  [64]  result 
Det(Lj),  and  its  proof  could  be  readily  carried  out  in  second-order  arithmetic. 
The  apparent  impasse  in  stalking  of  Borel  Determinacy  with  elementary  methods 
was  soon  to  be  explicated  by  a remarkable  metamathematical  analysis:  Harvey 
Friedman  was  an  early  popularizer  of  the  investigation  of  determinacy  hypotheses 
(cf.  28.16).  Formulating  games  where  the  players  must  code  models  of  fragments 
of  set  theory,  H.  Friedman  [71]  showed  in  1968  that  establishing  Borel  Determi- 
nacy would  require  the  existence  of  the  cumulative  hierarchy  through  , and  so 
in  particular  cannot  be  carried  out  in  Zermelo  set  theory,  i.e.  ZF  — Replacement. 
This  revealed  a remarkable  state  of  affairs,  since  Borel  Determinacy  is  a T^1  as- 
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sertion  about  sets  of  reals  yet  would  have  to  be  proved  using  uncountably  many 
iterations  of  the  power  set  operation.  Friedman’s  arguments  lead  to  a level-by- 
level analysis  showing  that  determinacy  at  each  new  level  of  the  Borel  hierarchy 
would  require  one  more  iteration  of  the  power  set  operation.  Fie  himself  pointed 
out  that  Det(Lj)  cannot  be  established  in  second-order  arithmetic. 

The  stakes  having  been  raised,  the  progress  was  initially  guided  by  Martin’s 
Det(nj)  proof  cast  in  terms  of  indiscemibles.  Davis  [64]  had  shown  that  deter- 
minacy is  preserved  by  the  taking  of  countable  unions  if  the  sets  involved  had 
sufficiently  simple  representations,  and  mixing  this  with  Martin’s  proof,  Baum- 
gartner had  found  an  alternate  proof  of  Det(£3).  Martin  then  saw  that  if  there  is 
a weakly  compact  cardinal,  then  there  is  such  a simple  representation  of  11®  sets 
in  terms  of  certain  indiscemibles  provided,  and  so  derived  DetfL®).  Soon  after- 
wards in  1971  Paris  [72]  procured  the  requisite  indiscemibles  in  ZFC,  establishing 
Det(E®)  outright. 

There  matters  stood  for  several  years,  and  in  retrospect  the  initiative  based 
on  indiscemibles  impeded  progress  toward  Borel  Determinacy.  Although  the  ap- 
proach of  Martin  [70]  was  to  be  seminal  for  the  investigation  of  higher  determinacy 
hypotheses,  the  large  cardinals  idea  had  to  be  cast  aside  for  another.  Blass  [75] 
and  independently  Mycielski  established  the  interesting  result  that  ADr  is  equiv- 
alent to  the  determinacy  of  all  games  of  length  &>2  with  integer  moves,  specified 

2 

in  terms  of  payoff  sets  included  in  ® &>.  The  proof  featured  moves  of  an  auxil- 
iary game  conveying  information  corresponding  to  strategies  for  the  given  game. 
Fashioning  and  elaborating  this  idea  for  the  Borel  sets  Martin  in  1974  established 
Borel  Determinacy : 

Theorem  31.3  (Martin  [75]).  Det(A[). 

Loosely  speaking,  the  proof  proceeded  by  associating  to  each  Borel  set  A 
corresponding  sets  X and  A*  C WX  such  that  A*  is  open  and  a player  has  a 
winning  strategy  in  GW(A)  exactly  when  he  has  one  in  GX(A*).  Consistent 
with  the  Friedman  anticipation,  if  A is  £®,  then  X has  roughly  the  cardinality 
of  Vffl+ff.  Of  later  emendations,  the  proof  was  reorganized  in  Martin  [85]  to 
subordinate  various  features  to  a more  directly  inductive  argument,  and  extended 
in  Martin  [90]  to  games  of  form  GX{A)  for  uncountable  X and  sets  A “Aj” 
in  this  context.  The  argument  was  further  streamlined  in  the  book  Martin  [oo], 
which  moreover  exposed  Martin’s  analysis  of  optimal  hypotheses:  He  established 
in  ZFC  — Power  Set  that  if  a < w\  and  Vm+a  exists,  then  Det(Aj+0,+3),  and 
refining  Friedman’s  arguments,  that  the  hypotheses  do  not  suffice  to  establish 
Det(E?+ff+3). 

The  establishment  of  Borel  Determinacy  was  a significant  advance  for  set 
theory  in  that  it  both  resolved  an  open  problem  whose  stakes  had  been  raised  by 
metamathematical  anticipations,  and  secured  the  Borel  world  for  the  consequences 
of  determinacy.  Among  a variety  of  applications,  it  now  followed  from  the  argu- 
ment for  Martin’s  28.4  that  any  Borel  subset  B of  “a  either  includes  or  is  disjoint 
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from  a Turing  cone,  i.e.  there  is  a e mco  such  that  {b  e au>  \ a <T  b)  c B or 
else  {b  e wa>  \ a <x  b]  fl  B — 0.  One  interesting  turn  of  events  is  that  while 
Wadge  in  his  studies  of  Wadge  degrees  of  Borel  sets  (recast  in  Louveau  [83])  had 
assumed  Borel  Determinacy  in  the  late  1960’s  in  order  to  avail  himself  of  Borel 
Wadge  Determinacy  - the  proposition  that  for  every  pair  of  Borel  sets  A,K  01  co 
the  Wadge  game  WG(A,  B)  (from  the  proof  of  Wadge’s  Lemma  29.15)  is  deter- 
mined - Louveau-Saint-Raymond  [87,  88]  later  established  that  this  proposition  is 
already  provable  in  second-order  arithmetic.  In  another  reverberating  development 
H.  Friedman  [81]  considerably  expanded  his  metamathematical  work  of  [71],  for- 
mulating in  particular  a simple  “Borel  diagonalization”  proposition  [81: 235]  which 
follows  from  Borel  Determinacy  and  also  requires  uncountably  many  iterations  of 
the  power  set  operation  but  is  moreover  n\ . 

It  is  the  direction  upward  from  Martin  [70]  to  stronger  hypotheses  and  con- 
clusions that  holds  the  sustained  interest.  The  early  efforts  at  getting  more  de- 
terminacy were  informed  by  results  showing  that  determinacy  hypotheses  beyond 
Det(n[)  require  the  consistency  of  measurability  and  more: 

The  first  break  was  made,  once  again,  by  Solovay.  Soon  after  he  showed  that 
full  AD  implies  the  measurability  of  co\  and  co 2,  he  established  that  DefiTTj)  implies 
the  existence  of  a transitive  e-model  of  ZFC  with  a measurable  cardinal.  After 
further  results  along  these  lines  by  H.  Friedman  [71a:  §3]  and  Martin,  Solovay 
then  established  that  Defi  Aj)  implies  the  existence  of  transitive  e-models  of  ZFC 
with  many  measurable  cardinals.  Subsequently  stronger  conclusions  were  derived 
by  Martin  and  others.  These  results  were  based  on  a way  that  Solovay  had  found 
for  exploiting  local  versions  of  Martin’s  result  28.4  about  Turing  degrees. 

Returning  the  compliment  Martin  in  the  meanwhile  had  gone  back  to  Solo- 
vay’s  original  method  for  establishing  the  measurability  of  a>\  from  AD.  By  that 
means,  he  was  eventually  to  derive  a series  of  results  that  exactly  correlated  the 
existence  of  many  measurable  cardinals  with  amounts  of  determinacy,  based  on 
the  difference  hierarchy. 

For  working  with  classes  closed  under  the  taking  of  countable  intersections,  if 
A is  such  a class  and  a < a>\,  the  class  a- A is  defined  by  stipulating  membership 
for  A C k(mo))  as  follows:  A is  in  a- A iff  there  is  a sequence  (Aj  | § < a)  of 
subsets  of*C"t»)  in  A such  that  Af  2 A$  for  f < $ < a and: 

w e A iff  the  least  § such  that  either  w f Aj  or  § = a is  odd  . 

(As  usual,  an  ordinal  is  odd  iff  it  is  of  the  form  S + 2 n + 1 for  some  limit  ordinal  S 
and  n e co.)  A can  be  viewed  as  a union  of  alternate  concentric  rings  of  a 3 -chain 
of  sets  in  A:  A is  in  1-A  iff  it  is  in  A;  A is  in  2-A  iff  A — Ao  — Ai  for  some 
sets  A,-  in  A;  A is  in  3-A  iff  A = (Ao  — Ai)  U A 2 for  some  sets  A,-  in  A;  A is 
in  4- A iff  A = (Ao  — Ai)  U (A2  — A3)  for  some  sets  A,-  in  A;  and  so  forth.  If 
a < f < co\,  then  a- A C f-A.  Set 

DiffCA)  = \Ja<coa-A  . 

This  cumulative  hierarchy  is  the  difference  hierarchy  for  A. 
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Hausdorff  [14:  460ff]  formulated  the  difference  hierarchy  for  closed  sets  and 
showed  that 

DiffiTT”)  = \°2  , 

using  this  “resolvability’'  to  establish  properties  of  this  class.  Kuratowski 
[58:  §37111]  generalized  this  work  through  the  Borel  hierarchy,  showing  that  for 
each 

Diff(II^)  = A°+1  . 

(The  work  of  Louveau-Saint  Raymond  [87, 88]  on  Borel  Wadge  Determinacy  is 
couched  in  these  terms.  An  analogous  characterization  of  A®  for  limit  8 is  avail- 
able.) Martin’s  analysis  was  based  on  the  a -11}  sets;  through  its  discussion  it 
should  be  kept  in  mind  that  in  contrast  to  the  Kuratowski  result  Diff(Il})  falls  far 
short  of  A*. 

The  sensible  formulation  of  lightface  versions  is  not  direct,  as  the  3 -sequence 
should  be  appropriately  definable.  For  (lightface)  a-IJj,  first  assume  that  a is 
recursive,  i.e.  there  is  a recursive  well-ordering  E C 2(mco)  with  ordertype  a, 
and  temporarily  let  ||m||  for  m e u>  denote  the  ordertype  of  the  predecessors  of 
m according  to  E.  Then  for  A C k(wco),  A is  a-77/  iff  there  is  a sequence 
(A|  | f < a)  of  subsets  of  k(u>co)  such  that  {(;«,  w)  | we  A||m||]  is  77/;  Af  2 A^ 
for  t,  < § < a;  and 

weA  iff  the  least  f such  that  either  w Aj  or  § = a is  odd  . 

For  a e 10 co,  analogously  define  the  class 

a-77/ (a)  for  a recursive  in  a . 

Then  for  any  A C k(0>co)  and  a < co\,  A is  a-Il}  iff  there  is  an  a e 03 co  such  that 
a is  recursive  in  a and  A is  a-n^(a). 

In  all  these  various  cases,  there  are  universal  sets  at  each  level,  and  so  the 
hierarchies  are  proper  (cf.  12.8). 

Martin’s  results  on  many  measurable  cardinals  went  hand-in-hand  with  a new 
elucidation  of  0#.  The  first  significant  result  about  sets  in  Diff(Ilj)  was  established 
by  Friedman  [71a],  who  extended  31.2  by  showing  that  if  0#  exists,  then  Det(3- 
n\ ).  He  also  found  a way  to  show  that  Defi/T/  ) fails  in  every  generic  extension 
of  L.  Martin  then  established  both  the  intrinsic  necessity  of  0#  and  the  relevance 
of  the  difference  hierarchy  by  establishing  the  following  characterization: 

31.4  Theorem  (Martin).  0#  exists  iff  13ct( <r,j2  /1-/7,1 ). 

(See  Martin  [oo]  or  Dubose  [90]  for  a proof.) 

In  the  wake  of  ideas  from  Jensen’s  Covering  Theorem  (see  volume  II),  Martin 
in  1975  applied  31.4  to  establish  the  converse  of  H.  Friedman’s  result:  if  Det(3- 
n\ ),  then  0#  exists.  The  gap  from  3-/7/  to  77/  loomed  larger  than  the  rest, 
but  Harrington  closed  it  shortly  afterwards  by  using  ideas  of  Sami  and  Steel  to 
establish  the  converse  of  Martin’s  31.2: 
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31.5  Theorem  (Harrington  [78]).  Suppose  that  a e "'a>  and  Det(/7j  (a)).  Then  a# 
exists. 

For  the  generality,  this  (and  31.2)  were  stated  in  a form  relativized  to  a e “<a. 
The  Martin  and  Harrington  results  together  provide  an  elegant  characterization  for 
any  a e wa>: 

a#  exists  iff  Det(/7/  (a))  . 

This  is  a remarkable  equivalence  between  the  existence  of  the  metamathematical 
sharps  and  the  determinacy  of  the  analytic  sets,  an  unexpected  confluence  that 
bolstered  both  the  large  cardinal  and  the  determinacy  theory  and  intimated  further 
riches  to  be  unearthed  about  an  underlying  unity. 

That  unity  was  considerably  reinforced  by  Martin’s  companion  results  on 
many  measurable  cardinals.  He  saw  that  starting  with  a + 1 measurable  cardinals 
he  could  get  Det((ttr-o:  + 1)-Il[),  and  in  fact  determinacy  a bit  further  along  the 
difference  hierarchy.  Continuing  the  pattern  of  31.1  and  31.2,  he  then  saw  that 
just  having  an  inner  model  with  a measurable  cardinals  and  indiscemibles  for  that 
structure  suffices,  and  moreover,  a converse  can  be  achieved! 

Suppose  that  M an  inner  model  of  ZFC.  Just  to  clarify,  say  that  M has  a 
measurable  cardinals  iff  there  is  a sequence  (pj  | £ < a)  of  ordinals  such  that  for 
each  | < a,  M \=  p%  is  measurable.  Suppose  next  that  M |=  S is  the  supremum  of 
the  measurable  cardinals.  Eschewing  details,  say  that  M has  indiscemibles  iff  it 
has  an  uncountable  set  of  ordinal  indiscemibles  whose  minimum  element  is  greater 
than  S.  In  light  of  §21,  having  such  indiscemibles  is  a notion  of  transcendence  over 
the  model,  and  an  0 : -type  theory  can  be  developed.  By  21.26  and  the  comment 
after,  for  any  a e 10 co,  a ^ exists  iff  there  is  an  inner  model  containing  a that  has  a 
measurable  cardinal  and  has  indiscemibles. 

The  following  generalizes  31.4  and  31.5,  which  together  can  be  regarded  as 
the  a — 0 case: 

31.6  Theorem  (Martin).  For  any  a e and  a recursive  in  a,  the  following  are 
equivalent: 

(a)  Det(U/j<(U2  (co2 -a  + f)-Tl\  (a)). 

(b)  Det((«2-a  + \fn\(a)). 

(c)  There  is  an  inner  model  of  ZFC  that  contains  a and  has  a measurable 
cardinals  and  has  indiscemibles. 

In  particular,  for  any  a e wa>, 

a^  exists  iff  Det((tt>2  + l)-/7/  (u))  . 

John  Simms  [79],  a student  of  Martin,  provided  an  analogous  characteriza- 
tion of  the  existence  of  a transitive  e-model  of  ZFC  satisfying  “there  are  arbitrar- 
ily large  measurable  cardinals”  which  has  indiscemibles  in  an  appropriate  sense: 
This  is  equivalent  to  the  determinacy  of  (Tl\  ),  the  class  of  countable  unions  of 
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Boolean  combinations  of  Tl\  sets.  (See  Martin  [oo:§5.5].)  Simms  in  fact  estab- 
lished equivalences  analogous  to  31.6  through  the  difference  hierarchy  for  Ej(77j) 
in  terms  of  hierarchically  high  measurable  cardinals.  Loosely  speaking,  each  new 
level  corresponds  to  a measurable  limit  of  cardinals  corresponding  to  the  previous 
level. 

These  various  determinacy  hypotheses  fall  far  short  of  Detf  Ai)  (which  by 
30.10  is  equivalent  to  Det(IL})).  An  impressive  result  at  the  time  was  due  to 
John  Green  [78],  a student  of  Harrington,  who  was  able  to  get  more  than  mea- 
surable limits  of  measurable  cardinals:  If  Det(zJ)),  then  there  is  an  inner  model 
of  ZFC  that  has  a measurable  cardinal  k with  a normal  ultrafilter  U such  that 
[a  < k | a is  measurable]  e U.  Such  a k is  high  in  the  “measurable  limits  of 
measurables”  hierarchy;  through  the  study  of  inner  models  Steel  [82]  was  able  to 
determine  just  how  much  determinacy  is  derivable  from  having  such  a k. 

There  were  later  emendations  having  to  do  with  refined  issues.  31.6  provides 
no  equivalence  for  Det(«2-/711(a))  (taking  a — 1 as  a typical  case).  Martin  showed 
in  the  late  1970’s  that  the  existence  of  an  inner  model  of  ZFC  that  contains  a and 
has  a measurable  cardinal  suffices,  and  eventually  improved  it  in  [90]  to  show  that 
if  there  is  an  inner  model  of  ZFC  that  contains  a and  has  a measurable  cardinal, 
then  every  set  of  reals  such  that  both  it  and  its  complement  are  in  (of  + 1 l-ZTj1  («) 
is  determined.  This  is  optimal  in  the  sense  that  by  31.6  indiscernibles  for  the 
inner  model  are  needed  to  get  Det {{of  + 1) -III  (a)).  Martin  also  established 
technical  equivalences  for  Det(&>2-/7j  (a))  in  terms  of  inner  models  that  have  a 
measurable  cardinal  and  closed  unbounded  sets  having  special  properties  for  such 
models.  Recently,  Philip  Welch  found  an  equivalence  for  Det(&>2-IlJ)  (boldface 
only)  more  analogous  to  31.6,  but  for  a weaker  set  theory. 

Derrick  Dubose,  a student  of  Martin,  has  explored  in  detail  how  sharps  corre- 
late with  determinacy  hypotheses  between  DetflJ^^  /J-TT,1 ) and  D Qt(co2-n\  ).  In 
terms  of  such  hypotheses  formulated  via  hierarchies  of  sets,  he  [90]  found  char- 
acterizations for  the  existence  of  the  iterated  sharps  0#,  0###,  and  so  forth.  He 

[91,92]  also  characterized  the  existence  of  the  sharp  function,  that  function  # on 
the  reals  given  by  #(a ) = a#,  and  extended  versions  of  this  function.  Moreover, 
he  [95]  characterized  the  existence  of  sharp  functions  on  objects  of  higher  type, 
sets  of  reals,  sets  of  sets  of  reals,  and  so  forth. 

Having  extensively  set  out  the  results  on  the  determinacy  of  sets  in  the  dif- 
ference hierarchy  for  11} , how  Martin  got  measurable  cardinals  out  of  such  hy- 
potheses is  illustrated  with  his  earliest  result  along  these  lines  established  in  the 
early  1970’s. 

31.7  Theorem  (Martin).  Assume  Det((or  + l)-/7j).  Then  there  is  an  inner  model 
with  a measurable  cardinal. 

Proof.  The  proof  uses  a game  which  is  a sort  of  &>-ary  iteration  of  the  one  used 
by  Solovay  in  his  original  proof  of  the  measurability  of  a>\.  The  players  choose 
integers  as  usual: 


446 


Chapter  6.  Determinacy 


I : x (0)  x (2) 

II  : x(l)  x(3) 

Using  a recursive  coding  scheme,  x/  is  regarded  as  encoding  a sequence 
<(x/)|  | § < co2)  € ®2+1(®tt>),  and  x//,  a sequence  <(x//)|  | £ < <w2}  e ® (®w). 
Recall  again  (§13)  that  each  z e ®o>  encodes  a relation  Ez  c.  co  x co  with  ||z|| 
denoting  its  ordertype  when  it  is  a well-ordering. 

Suppose  first  that  for  some  £ < co2,  either  E(xi)(  or  Eixn^  is  not  a well- 
ordering. Then  for  minimal  such  £,  if  E(Xl)%  is  not  a well-ordering,  then  II  wins. 
Otherwise,  I wins. 

Suppose  now  that  for  each  £ < co2  both  E(Xl)(  and  E(Xn)%  are  well-orderings. 
For  such  % and  n e co,  set 

P\  = ll(JC/)f||  , 
yxh  = ii(jc//)fn , 

Pn  = sup({^.„+;  | i e co}  U { Yo,.n+i  I i e «})  . and 
Sx  = sup({p^  | n e &>})  . 

Let  Fx  be  the  filter  over  <5*  generated  by  {px  \ n e co},  i.e. 

X e Fx  iff  X<Z8X  a 3m  e oj({pxn  \ m < n e co}  c X)  . 

The  intention  when  the  pf  s are  increasing  is  that  L[FX]  |=  FxnL[Fx]  is  a normal 
ultrafilter  over  Sx,  and  through  the  following  rules  the  burden  falls  on  I to  try  to 
defeat  this.  For  their  formulation,  recall  (as  before  20. 1 8)  that  each  z ewco  encodes 
a structure  Nz  = (co,  Ez,  Az),  and  when  it  is  well-founded  and  extensional,  tr(/V,) 
denotes  its  transitive  collapse  and  nz\  Nz  -»  tr(Nz)  the  collapsing  isomorphism. 

Stipulate  that  II  wins  unless  N(Xl),  is  well-founded  and  extensional,  and  for 
some  limit  ordinal  6X  > co, 

tr (AW  = (Le4Fxl  e,  F*  n Ltr[E'})  . 


Assuming  that  I has  managed  to  arrange  this,  II  wins  unless  there  is  a counterex- 
ample to  normality:  an  /:  Sx  — > Sx  such  that  f e EofEx}  and  {p  < Sx  | p < 
f(p )}  ^ Fx  yet  / is  not  constant  on  any  set  in  Fx . If  there  is  such  an  /,  taking 
f x to  be  the  least  such  in  the  canonical  well-ordering  of  F[FX],  II  wins  unless 
either 

*(0)  — 0 A fx(px)  > fx(px)  , or 

X(0)  = 1 A fx(Px)  < fx(px ) , 

in  which  case  I wins. 

To  analyze  the  payoff  set  of  this  game,  consider  for  £ < co2  the  sets 

Y 2^  = {x  e | E(X])t  is  a well-ordering}  , and 
121+1  = {x  e 01  co  | E( XlI)t  is  a well-ordering}  . 


31.  Det(a-IlJ) 


447 


There  is  also  a set  corresponding  to  the  various  conditions  for  I winning  the  game 
as  described  in  the  previous  paragraph: 

Y * = {xe  I E(xi)w2  is  well-founded  A N(Xi^2  \=  eri  a f 

A 7T(xp^ 2 d A/)(02  = F C\  7T lX{)m2  1 • 

where  a!  is  the  sentence  of  3.2(b),  and  \jr  is  a sentence  asserting  that  &>  exists  and 
that  there  is  no  largest  ordinal,  and  sets  out  the  various  conditions  for  I winning 
the  game  with  a counterexample  to  normality.  It  is  straightforward  to  check  that 
with  an  appropriate  f, 

I wins  a play  x iff  the  least  £ such  that  x £ or  f = co2  + 1 is  odd  . 

With  the  defining  clauses  for  Y other  than  well-foundedness  being  arithmetical 
as  for  13.8(a)(c)  it  is  straightforward  to  see  that  the  payoff  set  is  (co2  + 11-/7/  and 
so  the  game  is  determined  by  hypothesis.  As  in  other  Solovay  games,  I cannot 
have  a winning  strategy: 

Assume  to  the  contrary  that  a is  a winning  strategy  for  I . A bounding 
argument  is  first  applied  as  in  the  first  proof  of  28.2  to  get  a closed  unbounded 
subset  of  a>\ : For  ;/  < co\  set 

B,,  = {((<7  * y)/)f  | % < co2  A y e wcd  A 

Vf  < £(£(},)  is  a well-ordering  with  ||(y)f  ||  < p)}  . 

B,j  is  Lj  in  a real  coding  the  ordertype  ij.  Moreover,  in  terms  of  13.6,  B,t  C WF 
as  a is  winning,  and  so  Bn  C WF^  for  some  /3  < co\.  Taking  w(r] ) to  be  the  least 
such  p,  the  resulting  function  w:  a>\  -»  &>i  leads  to  a set 

C — [p  < co i | p is  an  infinite  limit  A Vrj  < p(w(p)  < p)} 
closed  unbounded  in  co\. 

C has  a crucial  property  which  will  be  used  several  times  in  what  follows: 
If  (pn  | n e co)  e 01 C,  then  there  is  a play  x e °f  game  according  to 

a such  that  p * — pn  for  every  n e cd.  To  show  this,  let  (?7j  | £ < cd2)  be  such  that 
for  each  n e cd,  {rpa-n+i  I t e cd}  c pn  and  is  cofinal  in  pn.  Then  let  y e 01  cd  be 
such  that  for  each  f < co2,  ||£(y){||  = r) |.  Then  o * y e n?<(i,2  and  it  follows 
from  the  definition  of  w and  C that  p„*y  — pn  for  every  n e cd. 

A contradiction  is  now  derived  from  the  existence  of  cr:  Fix  an  increasing 
sequence  (pn  \ n £ co)  of  limit  points  of  C such  that  C fl  pn  has  ordertype  pn  for 
each  n e cd.  Let  x e be  a play  according  to  a such  that  pxn  — pn  for  every 
n e cd.  There  is  a corresponding  Fx  and  a counterexample  fx  to  normality,  and 
.r(0)  must  be  either  0 or  1. 

Suppose  first  that  x(0)  = 0.  Fixing  k e cd  for  the  moment,  let  z e Ym 2 be  a 
play  according  to  0 such  that  p~  = pn+k  for  n e cd,  with  corresponding  Fz  and 
fz.  Evidently  Fz  — Fx,  and  so  also  fz  — fx.  But  z(0)  = x(0)  — 0,  as  .r(0)  is, 
after  all,  the  first  move  of  the  game  played  according  to  0.  Flence, 
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fx{pk)  = fz(Pi)  > fz(P\)  = fx{Pk+ 1) , 

the  inequality  as  a is  winning.  Now  this  holds  for  every  k e co,  and  so  fx(pk ) 
must  become  constant  for  sufficiently  large  k e co.  But  this  is  a contradiction,  as 
fx  cannot  be  constant  on  any  set  in  Fx . 

Suppose  next  that  x(0)  — 1.  Fixing  k e co  and  p e C with  p < pk  for  the 
moment,  let  y be  a play  according  to  a such  that  pyG  = p and  py+1  = pn+k  for 
n e co.  Then  Fy  — Fx,  fy  = fx,  and  _v(0)  — x(D)  — 1,  and  so 

fx(p)  = fy(Po ) < fy<J>\ ) = fx(Pk ) - 

Fixing  a further  ( e C with  p < t;  < pk,  a similar  argument  with  a play  z 
according  to  er  such  that  p\  = t,  and  p % = pi  for  every  n / 1 shows  that 

f(p)  < fx(0  ■ 

Varying  p and  f , it  follows  that  fx\(C C\ pk)  is  an  increasing  function  into  fx(pk). 
But  since  C fl  pk  has  ordertype  pk, 

Pk  < fx(Pk)  ■ 

Finally,  varying  k leads  to  {p  < Sx  \ p < fx(p)}  e Fx , which  is  a contradiction 
of  the  choice  of  fx. 

ITaving  established  that  I cannot  have  a winning  strategy,  it  follows  that  II 
must  have  one.  Let  r be  such  a strategy.  Following  the  argument  leading  from  a 
to  the  set  C,  r leads  to  a set  D closed  unbounded  in  co \ that  satisfies  the  following: 
If  (pn  \ n e a>)  e 10  D,  6 > co  is  a countable  limit  ordinal,  and  i < 2,  then  there 
is  a play  x e Rj «o2  °f  ^le  game  according  to  r such  that  pi  = p„  for  evety 
n e co;  9X  (is  defined  and)  equals  0;  and  .r(0)  = i. 

Fixing  an  increasing  sequence  (pn  \ n e co)  e w D,  set  8 = sup({p„  | n e co}) 
and  let  F be  the  filter  over  S generated  by  {pn  \ n e co}.  The  proof  is  completed 
by  showing  that  L[F]  \=  F fl  L[F]  is  a normal  ultrafilter.  This  follows  from  the 
following  property:  If  f e s8  fl  L[F]  with  {p  < 8 \ p < f(p)}  $ F,  then  f is 
constant  on  a set  in  F.  Granted  this  it  is  simple  to  show  first  that  F fl  L[F ] is  an 
ultrafilter  in  L[F}  using  functions  in  s2  fl  L[,F],  and  then  to  derive  normality. 

To  verify  the  property,  assume  to  the  contrary  that  there  is  a counterexample 
which  is  not  constant  on  any  set  in  F,  and  let  /*  be  the  least  such  in  the  canonical 
well-ordering  of  L[F'].  By  the  argument  for  20.2(a)  there  is  a countable  0 *, 
which  can  be  taken  to  be  a limit  ordinal,  such  that  /*  e LgfF].  Let  i*  — 0 if 
f*(po)  > f*(pi),  and  i*  = 1 if  f*(po ) < f*(pi).  Then  by  the  condition  on  D 
there  is  a play  x according  to  r such  that  pi  — pn  for  every  n e co;  9X  = 9 *;  and 
x(0)  — i*.  But  this  is  a contradiction,  since  r is  a winning  strategy.  The  proof  is 
complete.  H 

The  special  properties  exhibited  by  the  closed  unbounded  set  C in  this  proof 
lend  plausibility  to  the  optimal  conclusion  from  Det((u)2  + 1 )-n\ ),  the  existence  of 
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0 ^ . Little  more  complexity  beyond  this  proof  is  needed  to  show  that  if  Det((a>2-a  + 
D-/7/),  then  there  is  an  inner  model  that  has  a measurable  cardinals. 

Through  such  results  a pattern  of  correlations  with  large  cardinals  emerged 
during  the  mid-1970’s,  one  agreeably  reinforced  by  the  amounts  of  the  rank  hi- 
erarchy necessary  to  establish  fragments  of  Borel  Determinacy  as  the  beginning 
cases.  The  determinacy  hypotheses  involved,  however,  fell  far  short  of  De^ni) 
which  seemed  to  loom  ever  stronger  in  consistency  strength.  Through  the  1970’s 
a variety  of  substantial  propositions  were  shown  to  be  consistent  relative  to  a su- 
percompact cardinal.  But  despite  various  attempts  Detfllj)  was  not  among  them. 
Then  reaching  higher  to  the  hypotheses  10-13  at  the  limits  of  consistency  (§24) 
Martin  was  able  to  establish  a synthetic  connection  in  1978: 

31.8  Theorem  (Martin  [80]).  Assume  12.  Then  Det(Tl'). 

In  fact,  sufficient  is  the  existence  of  a j:  Vs  -<  Vs  satisfying  an  iterability 
condition,  a hypothesis  strictly  intermediate  between  12  and  13.  Martin  applied 
the  basic  idea  of  his  Det(Il[)  proof,  using  the  representation  of  IT]  sets  cast  in 
terms  of  an  embedding  j as  above  in  order  to  apply  its  24.6  properties. 

This  affirmation  of  large  cardinals  raised  hopes,  but  also  concerns.  Could 
there  be  some  sort  of  converse?  Any  such  result  would  likely  have  to  secure  an 
inner  model  of  some  large  cardinal  hypothesis  (cf.  31.7),  but  into  the  early  1980’s 
the  development  of  inner  model  theory  had  not  progressed  beyond  the  models 
of  Mitchell  [79].  These  were  for  the  “hypermeasurable”  cardinals,  essentially  y- 
strong  cardinals  for  small  y (§26),  and  the  amount  of  determinacy  in  them  was 
analyzed  by  Steel  [82].  In  any  case,  the  models  all  have  A 3 well-orderings  of  the 
reals  of  ordertype  co\,  and  so  cannot  satisfy  DehTlJ,)  (by  27.14  and  the  argument 
for  13.10). 

This  apparent  obstruction  to  getting  inner  models  for  large  cardinals  implying 
Detfllj)  was  a major  reason  advanced  for  its  considerable  strength,  and  Martin’s 
result  ruled  out  inner  models  of  the  sort  then  known  for  hypotheses  at  the  level 
of  10-13. 

The  other  question  was  where  the  higher  determinacy  hypotheses  would  fit  in 
if  at  all;  perhaps  they  veer  off  in  some  orthogonal  direction  from  large  cardinals? 
With  attention  shifted  upward  to  the  strongest  large  cardinal  hypotheses,  Woodin 
boldly  formulated  10  just  at  the  edge  of  the  Kunen  inconsistency  and  amplified 
Martin’s  embeddings  proof  to  establish  the  following  in  early  1984: 

31.9  Theorem  (Woodin).  Assume  10.  Then  ADL(R). 

So  determinacy  is  subsumed  by  the  large  cardinal  hierarchy,  but  only  just! 
However,  the  hypotheses  10-13  were  little  understood  and  might  totter  as  Kunen’ s 
proposition  did.  And  there  seemed  little  hope  that  the  inner  model  theory  could 
be  developed  enough  to  achieve  equiconsistencies.  But  at  least  a mooring  was 
secured,  and  there  was  new  anticipation  of  a grand  unification  of  large  cardinals 
and  determinacy.  How  that  came  about  is  the  subject  of  the  next  section. 
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32.  Consistency  of  AD 

As  a fitting  conclusion  to  both  this  chapter  and  volume,  this  section  provides  a 
panoramic  survey  of  the  crowning  achievements  concerning  the  relative  consis- 
tency of  determinacy  hypotheses.  After  setting  out  the  basics  of  what  came  to 
serve  as  a general  framework,  proofs  are  dispensed  with,  as  in  any  case  they 
are  precluded  by  the  range  and  sophistication  of  the  results  discussed.  Rather,  a 
scaffolding  is  provided  based  on  the  historical  progression  of  ideas,  one  that  may 
serve  as  a guide  to  expositions  to  come  (Martin  [oo]  and  Woodin-Mathias-Hauser 
[oo]). 

The  culminating  equiconsistency  results  are  due  to  Hugh  Woodin,  whose  re- 
sults have  figured  in  previous  sections.  Already  as  an  undergraduate  Woodin  in 
1976  had  reduced  a proposition  in  the  theory  of  Banach  algebras  to  a set-theoretic 
one  whose  consistency  was  then  established  by  Solovay.  Then,  as  his  student 
at  Berkeley,  Woodin  found  a more  systematic  route  to  that  consistency  incorpo- 
rating Martin’s  Axiom  (see  Dales-Woodin  [87]),  and  soon  established  important 
consistency  results  using  Radin  Forcing  and  about  ideals  over  a)\  (see  volume  II). 
With  strong  hypotheses  and  structural  issues  in  set  theory  becoming  the  driving 
imperatives  in  his  work,  Woodin  joined  the  Cabal  Seminar  in  the  early  1980’s  and 
became  deeply  involved  in  the  investigation  of  determinacy.  By  the  mid- 1 980 ’s 
his  many  splendid  results  had  earned  him  a reputation  comparable  only  to  that 
of  Shelah,  who  has  been  in  the  limelight  since  the  1970’s.  But  unlike  the  lat- 
ter Woodin  is  not  one  to  rush  to  the  presses,  and  the  bulk  of  his  work  remains 
unpublished. 

Stepping  back,  it  was  Martin  and  Steel  who  made  a major  breakthrough 
toward  the  relative  consistency  of  determinacy  from  large  cardinals,  and  for  Martin 
this  was  the  capstone  of  nearly  twenty  years  of  sustained  effort.  In  retrospect  the 
quest  for  this  synthesis  was  based  on  a remarkable  insight  that  principles  in  ZFC 
based  on  elementary  embeddings  could  somehow  provide  combinatorial  principles 
for  sets  of  reals  entailing  their  regularity  properties.  Affirming  Martin’s  early 
successes  using  tree  representations  to  establish  determinacy  a general  approach 
to  such  representations  is  first  provided;  it  is  an  elegant  abstraction  of  the  essentials 
of  previous  constructions  involving  large  cardinals. 

Homogeneous  Trees 

Homogeneous  trees  are  implicit  in  the  work  of  Martin-Solovay  [69]  (described  in 
§15)  and  in  early  work  on  determinacy:  Martin’s  Det(IlJ)  result  31.1  and  Kunen’s 
1971  analysis  of  <5j  and  <5,!.  The  concept  itself  was  isolated  much  later  by  Kechris 
[81:  63]  and  Martin,  and  came  to  be  the  means  by  which  determinacy  was  derived 
from  large  cardinals.  In  the  paradigmatic  case,  for  any  k > a>,  n|  sets  of  reals  are 
/cf-Suslin  (13.14),  i.e.  projections  of  trees  on  co  x k.  But  the  infinite  paths  through 
such  a tree  form  a closed  subset  of  “w  x wk  in  the  expected  sense  (cf.  12.10),  and 
so  are  determined  (27.1).  Martin  derived  Det(IlJ)  by  taking  k to  be  measurable 
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and  using  a normal  ultrafilter  over  zc  to  “integrate”  over  winning  strategies  for 
closed  subsets  of  wco  x ®zr  to  get  such  strategies  for  their  projections,  the  n] 
subsets  of 10 co.  Recalling  the  §15  analysis  homogeneous  trees  are  just  those  trees 
that  carry  the  requisite  ultrafilter  structure  to  push  through  Martin’s  argument. 
Some  terminology  is  first  developed  toward  the  definition: 

For  a tree  T on  some  kco  x Y (as  defined  after  12.8)  and  for  s e [Jm€co  k('"co), 

Ts  — {u  | {s,  u)  e T } . 

In  this  section  by  tree  is  meant  some  such  tree,  usually  taking  k = 1 for  notational 
simplicity,  such  that  Ts  0 for  every > such  s.  Also,  principal  ultrafilters  are 
allowed,  contravening  a convention. 

Suppose  that  for  some  set  Y and  m < n e co,  U is  an  ultrafilter  over  some 
I C mY  and  W is  an  ultrafilter  over  some  J C n Y . Then  U is  a projection  of  W 
iff  for  any  X C /, 


X e U iff  {u  e J \ u\m  e X]  e W . 

If  moreover  U and  W are  &>i -complete,  then  with 

ju:  V -<  Mv  = Ult(V,  U)  and 
jw:  V -<  Mw  = Ult(Y,  W)  , 

the  map  k : Mv  ->  Mw  given  by 

k([f]u)  = [(f(u\m)  | u e J)]w 

is  elementary,  and  k o jv  — jw- 

A sequence  (Um  \ m e co)  of  co\ -complete  ultrafilters  is  a tower  iff  for  some 
set  Y , Um  c V(mY)  for  m e co,  and  Um  is  a projection  of  Un  for  m < n e co.  For 
such  a tower,  let 

j,„:  V < Mm  = Ult(V,  Um)  , 
and 

jmn  - Mm  y Mn 

be  the  embeddings  corresponding  to  Um  being  a projection  of  Un,  so  that  jmn o jm  — 
jn.  Then 

((Mm  I m e co),  {jmn  I m < 77 » 

is  a directed  system. 

([/„,  | 777  e co)  is  countably  complete  iff  whenever  Xm  e Um  for  m e co, 
there  is  a d e mY  such  that  of|77z  e Xm  for  each  such  m.  By  the  argument  for 
15.7,  a tower  is  countably  complete  iff  the  corresponding  directed  system  has  a 
well-founded  direct  limit. 

To  proceed  to  the  main  definitions,  for  a tree  T on  some  kco  x Y and  k > co, 
taking  k — 1 for  notational  simplicity,  T is  k -homogeneous  iff  for  s e <01u>  there 
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are  a: -complete  ultrafilters  Us  over  Ts  such  that  Us  is  a projection  of  Uj  whenever 
s c s,  and:  For  any  x e wa>, 

if  x e p[T ],  then  (Ux \m  \ m e co)  is  a countably  complete  tower  . 

The  converse  is  immediate,  since  if  X,„  — Tx \m  for  m e to,  a corresponding  d e MY 
such  that  d\m  e Xm  for  such  m verifies  that  x e p\T\. 

For  A C k(wco),  A is  k -homogeneously  Suslin  iff  A — p[T]  for  some  k- 
homogeneous  tree  T.  A is  homogeneously  Suslin  iff  A is  k -homogeneously  Suslin 
for  some  k > co.  Note  that  for  such  sets  to  exist  in  any  substantive  sense,  there 
must  be  a measurable  cardinal:  Otherwise  the  Uf  s would  have  to  be  principal 
(2.2),  and  so  being  homogeneously  Suslin  would  be  equivalent  to  being  closed 
(cf.  12.10).  Also,  note  that  by  the  argument  for  2.8,  it  can  be  assumed  that  k > 2K° 
and  much  more. 

As  anticipated,  the  following  two  exercises  factor  Martin’s  Det(Tl[)  result 

31.1  through  the  concept  of  homogeneous  tree. 

32.1  Exercise.  Suppose  that  there  is  a measurable  cardinal  k.  Then  every  11}  set 
is  k -homogeneously  Suslin. 

Hint.  Show,  following  the  proof  of  3 1 . 1 , that  the  Us  ’s  defined  there  are  as  required. 

H 


32.2  Exercise.  Suppose  that  a set  of  reals  is  homogeneously  Suslin.  Then  it  is 
determined. 

Hint.  Suppose  that  A — p[T]  say,  where  T is  a k -homogeneous  tree  on  cox  Y with 
corresponding  k -complete  ultrafilters  Us  for  s e <l°co.  To  establish  that  GW(A)  is 
determined,  consider  the  auxiliary  game  G*  as  in  the  proof  of  3 1 . 1 : 

/ : (*«)),  s(0)>  <*(2),s(l)> 

II:  x(l)  x(3) 

I and  II  both  play  integers  x(i)  to  form  a real  x,  but  in  addition,  I plays  g(i)  € Y 
to  form  a function  g:  co  —>  Y.  I wins  if  (x,  g)  e [T],  and  otherwise  II  wins. 

As  before,  G*  is  determined,  and  if  I has  a winning  strategy  in  this  game, 
he  has  one  in  GW(A).  It  remains  show  that  if  II  has  a winning  strategy  r in  G* , 
then  he  has  one  strategy  in  Gffl(A): 

Proceeding  as  for  31.1,  for  n e co,  t e 2"+la>,  and  u e "+1T  let  r(r,  u ) denote 
the  strategic  response  according  to  r to 

«f(0),  m(0)>,  r(l),  . . . , {t(2n),  u(n)))  . 

Then  for  such  t and  k e w,  set 

Za  = { u e 7f|(n+i)  | r(t , u)  = k}  . 

Next,  define  a strategy  tq:  {_)n€012n+x(r>  — > co  by: 
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r o(t)  — k iff  Zt'k  e U, |(„+D  . 


This  is  well-defined  by  the  u>\ -completeness  of  the  Uf  s.  Finally,  argue  that  to  is 
a winning  strategy  for  II  in  Gm(A): 

Assume  to  the  contrary  that  x is  a play  according  to  r0,  yet  x e A.  By  the 
homogeneity  of  T there  is  a d e 10  Y such  that  for  every  n e a>,  d\(n  + 1)  e 
Zx|(2H+i),To(.Y|(2n+i))-  But  then,  (x,  d)  is  in  [T]  and  corresponds  to  a play  of  G*  in 
which  II  plays  according  to  r and  still  loses.  Contradiction!  H 

Weakly  Homogeneous  Trees 

Weak  homogeneity  for  trees  is  a technical  weakening  of  homogeneity  also  consid- 
ered by  Kechris  [81:  66]  and  Martin,  which  through  the  work  of  Woodin  became 
pivotal  as  the  first  structural  property  related  to  determinacy  to  be  obtained  from 
large  cardinals.  For  a tree  T on  some  ka>  x Y and  k > a>,  again  taking  k = 1 for 
simplicity,  T is  k -weakly  homogeneous  iff  for  s,  t e <0}a>  with  |s|  = |f|  there  are 
a: -complete  ultrafilters  USJ  over  Ts  such  that  USJ  is  a projection  of  Uj  -t  whenever 
s\m  — s and  t\m  — t for  some  m e w,  and:  For  any  x:  e 

if  x e p[T],  then  there  is  a y e “a  such  that 

(Ux\m,y\m  I € co)  is  a countably  complete  tower  . 

As  with  homogeneous  trees,  the  converse  is  immediate.  Clearly,  a k -homogeneous 
tree  is  k -weakly  homogeneous. 

For  A C k(w(d),  A is  K-weakly  homogeneously  Suslin  iff  A — p[T\  for  some 
a: -weakly  homogeneous  tree  T,  and  A is  weakly  homogeneously  Suslin  iff  A is 
A' -weakly  homogeneously  Suslin  for  some  a:  > co.  As  with  homogeneity  it  can  be 
assumed  that  k > 2K°  and  much  more. 

The  following  two  exercises  cast  weak  homogeneity  in  different  lights. 

32.3  Exercise.  A C k<f(D)  is  K-weakly  homogeneously  Suslin  iff  A — pB  for 
some  k -homogeneously  Suslin  B C i+1(a>®). 

Hint.  Taking  k = 1,  suppose  in  the  forward  direction  that  A = p[T ] where  T is  a 
a: -weakly  homogeneous  tree  on®x7  with  corresponding  a: -complete  ultrafilters 
USJ  for  x,  t e <a>u>  with  |x|  = |r|.  For  any  x,  y e wu>,  if  {Ux |m,y|m  | m e a>)  is  not 
countably  complete,  let  X?„y  e Ux \m,y\m  for  m e u>  be  such  that  no  d e WY  satisfies 
d\m  e X',,;'  for  every  such  w;  otherwise,  set  Xxmy  — Tx \m  for  every  m e w.  Next, 
for  s,t  e <mu>  with  |s|  = \t\  set 

ZSJ  = niK\  I x,y  e^co  A x||^|  = ^ a y\\t\  — t}  . 

Then  ZSJ  e USyt  since  USy,  is  (2K°)+ -complete  (by  the  argument  for  2.8,  as  noted 
before).  Finally,  define  a tree  T+  on  2o>  x Y by 


T+  = {(x,  t,  u)  | u e ZSytj  , 
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and  set  U SJ  — USJ  fl  V(ZSJ)  for  s,te  <aco  with  |s|  = |f|.  Now  verify  that  T+ 
with  the  Us/s  is  k -homogeneous,  and  that  with  B = p[T+],  A — pB. 

The  converse  is  straightforward.  H 

If  homogeneously  Suslin  sets  are  to  be  viewed  as  generalizations  of  closed 
sets,  the  verifying  ultrafilters  ensuring  determinacy,  then  weakly  homogeneously 
Suslin  sets  can  be  regarded  through  32.3  as  generalizations  of  analytic  sets.  It 
now  follows  from  32.1  that  if  there  is  a measurable  cardinal  k,  then  every  £,  set 
is  K-weakly  homogeneously  Suslin. 

32.4  Exercise.  Suppose  that  T is  a tree  on  some  kco  x Y and  k > co.  Then 
T is  K-weakly  homogeneous  iff  there  is  a countable  collection  V of  K-complete 
ultrafilters  such  that  for  any  x e k{u’co):  If  x e p\T\  then  there  is  a countably 
complete  tower  {Um  \ m e co)  e such  that  Tx \m  e Um  for  every > m e co. 

Hint.  Taking  k — 1 , for  the  direction  starting  from  V it  can  be  assumed  that  V is 
closed  under  projections:  If  W e V and  W c V(nY)  for  some  n e co  and  set  Y, 
then  for  each  m < n 


{{u\m  | u e Z}  | Z e W]  e V . 

Proceed  now  by  induction  on  |j|  to  define  USJ  so  that  for  any  i e co,  allowing 
principal  ultrafilters  if  necessary, 

I e e tt>}  2 

{W  fl  V(Ts~yf  | W e V A Ts~(i)  e W A Us  t is  a projection  of  W}  . 


32.3  shows  how  weak  homogeneity  is  closely  tied  to  homogeneity,  but  32.4 
suggests  that  the  former  may  be  considerably  easier  to  obtain.  One  consequence 
is  that  being  weakly  homogeneously  Suslin  is  preserved  under  Wadge  reducibility, 
i.e.  the  taking  of  continuous  pre-images: 

32.5  Exercise.  Suppose  that  B C 1 (fco)  is  K-weakly  homogeneously  Suslin, 
/:  k(wco)  -*  l(a>co)  is  continuous,  and  A = f~x(B).  Then  A is  K-weakly  ho- 
mogeneously Suslin. 

Hint.  Taking  k — l — 1,  suppose  that  B — p[T\  where  T is  a tree  on  co  x Y,  and 
let  A = p[5]  where  S is  the  tree  on  co  x Y defined  as  in  the  proof  of  13.13(g) 
from  T and  a function  6 . Show  that  if  V verifies  the  weak  homogeneity  of  T as 
in  32.4,  then  it  also  does  so  for  S: 

Suppose  that  x e p[S].  Then  f(x)  e p[T],  so  that  there  is  a countably 
complete  tower  (Um  \ m e co)  e MT>  such  that  Tf(x)\n  e Un  for  every  n e co.  To 
complete  the  proof,  check  that  Sx\m  e Um  for  every  m e co: 
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S, \,„  = [u  e’"Y  | (x\m,  u)  e 5} 

= {«  g "!F  | (0(x\m),  m| |6>(jc|w)|)  g T} 

= {uemY  | u\n  e Tm\n) 

where  n = \6(x\m)\,  so  that  6(x\m)  = f(x)\n.  But  since  Tf(x)\n  e Un  and  Um 
projects  to  Un,  it  follows  that  Sx \m  e Um.  ~\ 

A crucial  feature  of  weakly  homogeneously  Suslin  sets  is  that  their  com- 
plements are  also  Suslin  by  a canonical  “dualization”  process.  This  was  already 
developed  in  §15,  and  with  the  motivation  given  there  we  can  proceed  forthwith 
to  the  formulation: 

Suppose  that  T is  a a: -weakly  homogeneous  tree  on  coxY  with  corresponding 
a: -complete  ultrafilters  Us  t for  s,  t e <0>co  with  |s|  = \t |.  For  s,  t,s,t  G <ma>  with 
s — s\m  and  t = t\m  for  some  m e co,  let 

js.r-  v -<  MSit  = Ult(V,  USJ) 

and 

j(s,t),(s,l)-  Ms,t  -<  M-j 

be  the  embeddings  corresponding  to  Us  t being  a projection  of  U-t,  so  that 
j(s,t),(s,t)  ° Js.t  — h,r  terms  of  the  fixed  recursive  enumeration  (s,-  | i e co) 
of  <a>co,  define  a tree  onax  (2|i'l)+  by: 

f = l(s,  (Si  | i < |5|)>  e U x m((2|F|)+)  | 

Vf,  j < M(S,'  D S j — > Sj  < 7(j||sJ.|,sJ),(i||s;|,s,)  (^/))1  • 

The  first  part  of  the  following  is  a representation  that  recasts  15.5  and  15.7 
in  terms  of  countably  complete  towers;  the  second  shows  that  that  representation 
is  quite  robust. 

32.6  Proposition. 

(a)  p[f  ] = wu>  - p[T], 

(b)  For  any  p.o.  P with  \ P\  < a c,  |b/>  p[T]  = wco  — p\T], 

Proof,  (a)  Suppose  first  that  x e p[T],  say  with  (. x,g ) e [f].  For  i e co,  let 
hr-  Tx ||S(|  On  be  such  that  g(i)  = [/;,]  in  MV||S;|,S;.  Then  for  s;  D Sj, 

'Zjj  = [u  E Tx 1 1 s,  I hj(u ) < hj  (u | \$j  |)}  G Ux 1 1 s,| , s,  • 

Assume  now  to  the  contrary  that  x € p[T],  Then  there  would  be  a y G a>co 
such  that  (Ux\m,y\m  \ m G co)  is  countably  complete,  and  so  a d G mY  such 
that  for  any  s(-  c y,  d||s,|  G fj ( Yl}  \ s ,•  D s, } . Flence,  when  y D s,-  D Sj, 
hf  (d\ |s/ 1)  < hj(d\\Sj\).  This  leads  to  an  infinite  descending  sequence  of  ordinals 
and  thus  a contradiction. 

For  the  converse,  suppose  that  x G wco  — p[T ].  Since  Tx  is  well-founded,  let 
p:  Tx  ->  ||  Tx  ||  be  its  rank  function.  For  i G co  set: 
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si  - tPirC||Sl.|]£/,M,|„i  . 


Then  for  s(-  D Sy, 


iu  ^ rx||Sj|  | pill)  < p(w||Sy|)}  € t^c||s,'|,Si 

so  that  Sj  < 7(s||Sj-|,sj),(s|i|si|,Si)(^')-  To  finish  off  an  inconsequential  case,  if  Y is 
finite,  then  the  Us/s  are  all  principal,  and  so  p[T ] = “&>.  If  Y is  infinite,  then 
note  that  for  i e co, 


l«.-|  < I {/  I /:  Tx | ->  ||PJ}|  < 2m  . 

Hence,  all  the  conditions  being  satisfied,  .r  e p[T]. 

(b)  Suppose  that  P is  a p.o.  with  |P|  < k and  G is  P-generic.  It  must 
be  shown  that  in  V[G],  p[f]  — mco  — p[T ].  By  the  proof  of  10.15,  each  Ust 
generates  a a: -complete  ultrafilter  Uft  over  Ts  in  V[G],  For  s,  t,s,t  € <<u&>  with 
s — s\m  and  t = J\m  for  some  m e co,  let 

itr  V[G]  < M+  = Ult(V[G],  U+)  , 

and  as  (7+  is  readily  seen  to  be  a projection  of  U-~,  let 


be  the  corresponding  embedding,  so  that  - o = j+..  Then: 

(i)  If  f:Ts-+V  is  in  V[G],  there  is  a g:  Ts  — > V in  V such  that 
[/][/+  = [#][/+  • 

(ii)  For  any  /:  Ts-r  V in  V,  [f]v+  = [f]Ust  . 

(“)  ^<t,«>,<J,r>|0n  = J{s,t),(s,t)\°n  ■ 

(i)  follows  from  the  claim  (*)  of  the  proof  of  10.15.  (ii)  follows  from  (i)  by 
induction  on  rank.  Finally,  for  (iii),  note  first  that  for  any  ordinal  §,  % — [g]v+ 
for  some  g:  Ts  —>  V in  V by  (i).  But  then, 

= l(g(u \m)  | u e Tj)]v+ 

= [{g(u\m)  | u e Tj) ]v..  (by  (ii)) 

j(s,t),{s,t)  ([<?]t/$,f ) 

= <?,?>([«]£/+)  (aSain  by  (b)) 


confirming  (iii). 

Proceeding  to  the  argument  that  p[T\  = ">w  — p[T]  in  V[G],  note  first  that 
p[T]  n p[f]  = 0 there:  Define  a tree  S merging  T and  f by 


5 = {(^,  u , (Sj  | i < |jr|»  | {s,  u)  e T A (s,  (Sj  \ i < M»  e T}  . 
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Then  in  V as  well  as  in  V[G],  /;>[£]  = p[T]  fi  p[f  ] so  that  p[T]  n p[f]  — 0 iff 
S is  well-founded.  But  then,  S is  well-founded  in  V,  and  hence  by  absoluteness 
(0.3)  in  V[G]  as  well,  and  the  result  follows. 

It  remains  to  show  that  in  V [G]  if  x e mco  — p\T~\,  then  x e p[T]:  Arguing  in 
V[G]  as  for  (a),  since  Tx  is  well-founded,  let  p:  Tx  -»  || Tx ||  be  its  rank  function, 
and  for  i e co  set 

Si  = [Pl7’x||Sf|]c/+|i.|i<.  • 

For  s,-  D s j,  Si  < 7(t||Sj.|,S/K(s||Si|,Si)(^)  as  before,  so  that  by  (iii), 

Si  < j{s\\sj\,Sj),{s\\si\,Si)(Sj)-  Hence,  {x,  ( <5*  \ i e co))  e [?],  and  so  x e p[f],  H 

Soon  after  their  formulation,  it  was  seen  that  weakly  homogeneously  Suslin 
sets  possess  the  regularity  properties.  The  first  arguments  grew  out  of  the  gener- 
ative game  context,  and  Woodin  found  new  forcing  proofs  indicative  of  the  new 
abstract  approach.  Versions  of  both  are  given  in  the  following  proof. 

32.7  Theorem.  Suppose  that  a set  of  reals  is  weakly  homogeneously  Suslin.  Then 
it  is  Lebesgue  measurable,  has  the  Baire  property,  and  has  the  perfect  set  property. 

Proof.  Suppose  that  A c wa>  is  /r -weakly  homogeneously  Suslin.  The  perfect  set 
property  for  A is  first  established  in  terms  of  games: 

Consider  the  game  G5 (^“A)  from  21.6  for  the  perfect  set  property.  By 
having  I not  play  finite  sequences  but  rather  integers  coding  them  the  game  can 
be  recast  as  one  of  form  GW(B).  Some  straightforward  checking  shows  that  B can 
be  taken  to  be  a continuous  pre-image  of  A,  and  thus  is  k -weakly  homogeneously 
Suslin  by  32.5. 

By  32.3  there  is  a C c ) k -homogeneously  Suslin  with  B — pC.  As 
in  the  proof  of  27.14,  consider  the  unfolded  version  of  the  game  GW(B ) where 
I makes  extra  choices  y(i)  leading  to  a real  y,  and  I wins  if  (x,  y)  e C,  and 
otherwise  II  wins.  By  a simple  modification  of  the  argument  for  32.2,  this 
unfolded  game  is  determined.  But  Gm(B)  is  essentially  G\{'P“A),  and  so  by  the 
argument  for  27.14  A has  the  perfect  set  property. 

Lebesgue  measurability  and  having  the  Baire  property  can  be  established 
similarly,  although  more  checking  has  to  done  to  be  able  to  apply  32.5.  Instead, 
Woodin’ s forcing  argument  is  given,  which  builds  on  Solovay’s  for  11.11: 

Let  I be  a k -weakly  homogeneous  tree  such  that  A = p[T ],  and  T the 
canonical  tree  for  the  complement  so  that  32.6  is  satisfied.  To  show  that  A is 
Lebesgue  measurable  proceed  as  for  11.11  in  terms  of  random  reals,  the  corre- 
sponding p.o.  £>*,  and  closed  and  Gs  codes: 

With  ||-  temporarily  denoting  a set  restriction  of  the  forcing  relation  for  B* 
sufficient  for  the  coming  argument,  let  p be  a regular  cardinal  so  that  contains 
T,  T,B*,  and  ||-  , and  let  /V  «<  if  be  countable  and  containing  these  sets.  In 
what  follows,  assume  the  straightforward  liberalization  of  the  necessary  forcing 
concepts  to  such  models  of  just  ZFCV.  Note  that 


458 


Chapter  6.  Determinacy 


{x  e “&>  | x is  not  random  over  N} 

is  null,  being  equal  by  11.10  to 

|J{AC  | c e N is  a G$  code  for  a null  set}  , 

a countable  union  of  null  sets  since  N is  countable.  Hence,  it  suffices  to  find  a 
Borel  set  X such  that  A AX  consists  of  reals  not  random  over  N: 

For  B*  forcing  in  the  sense  of  N with  r a canonical  name  for  the  random 
real,  let  Q e N be  a maximal  antichain  consisting  of  closed  sets  each  deciding 
r e p[T\,  all  in  the  sense  of  N.  Set 

X = l){Ac\  A"  &Q  A ANc\[i-  ep[T]). 

Since  Q is  countable  by  the  <wi-c.c.,  X is  Borel  (in  fact  Fa).  To  complete  the 
proof  it  suffices  to  show  that  for  x random  over  N, 

x e A iff  x e X . 

First,  if  x is  random  over  N and  x e X,  then  /V[v]  \=  x e p[T\  by  the  definition 
of  X,  and  hence  x e p[T\  — 4 . For  the  converse,  the  crucial  property  32.6(b) 
is  used:  Since  B*  has  a dense  subset  of  cardinality  2K°  (the  closed  non-null  sets) 
and  k > 2k°  can  always  be  assumed, 

lb-  P[T]  =“co-  p[f \ . 

It  follows  from  this  and  the  definition  of  X that  if  x is  random  over  N and  x £ X, 
then  2V[jc]  |=  x e /?[f],  and  hence  x e p[T]  =°)a>  — A. 

The  argument  for  the  Baire  property  is  analogous,  using  Cohen  reals.  H 

The  Ramsey  property  was  discussed  at  the  ends  of  §§11,27.  An  analogous 
forcing  argument  using  Mathias  reals  shows  that  being  Ramsey  can  be  added  to  the 
conclusions  of  32. 7.  And  as  for  Lebesgue  measure  and  Baire  category  there  is  a 
more  involved  game  argument.  This  depends  on  Ramseyness  being  equivalent  to 
having  the  Baire  property  in  a topology  (as  mentioned  in  §27),  and  the  determinacy 
of  the  unfolded  version  of  the  corresponding  **-game  as  in  the  proof  of  27.14  (see 
Kechris  [95:  19D.21D]). 

The  forcing  approach  depended  on  the  32.6  property  of  both  a set  of  reals  and 
its  complement  being  Suslin  with  representations  that  are  robust  through  forcing 
extensions.  In  Feng-Magidor-Woodin  [92],  this  property  is  called  “universally 
Baire”  and  studied  for  its  own  sake  as  well  as  in  connection  with  determinacy. 

Having  set  the  stage  with  the  basic  theory  of  homogeneous  and  weakly  ho- 
mogeneous trees,  we  begin  the  historical  ascent.  Martin  made  the  first  connection 
between  determinacy  and  weak  homogeneity  by  showing  in  ZF  that  ADr  implies 
that  eveiy  tree  is  coi-weakly  homogeneous.  Inspired  by  this,  Woodin  by  1982  had 
established  the  first  result  in  the  general  framework  of  the  subsequent  results: 
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32.8  Theorem  (Woodin  [86]).  Suppose  that  k is  a supercompact  cardinal  and  G 
is  Col(&>,  K)-generic.  Then  L(M)'dGl  |=  Every  tree  is  k -weakly  homogeneous. 

The  appeal  to  L(R)V[C1  is  as  for  Solovay’s  11.1.  Woodin  used  ultrafilters 
given  by  supercompactness  to  verify  weak  homogeneity,  but  in  any  case  32.8  is 
a relative  consistency  result.  To  describe  the  progress  toward  actual  implications 
from  large  cardinals,  the  situation  is  first  summarized  in  the  investigation  of  strong 
hypotheses  in  the  early  1980’s,  a subject  that  is  taken  up  at  length  in  volume  II. 

The  Role  of  Supercompact  Cardinals 

After  the  introduction  of  the  strong  large  cardinal  hypotheses  (as  described  in 
Chapter  5)  the  work  through  the  1970’s  led  to  a growing  conviction  that  these  hy- 
potheses provide  an  ultimate  scale  against  which  all  possible  consistency  strengths 
can  be  gauged.  The  stronger  propositions  were  usually  found  to  be  consistent  rel- 
ative to  supercompact  cardinals,  and  also  to  imply  the  consistency  of  having  many 
measurable  cardinals  in  a strong  sense.  There  was  one  early  and  notable  excep- 
tion, however,  the  proposition  that  there  is  an  ^-saturated  ideal  over  w\.  Kunen 
[78]  in  1972  had  established  the  consistency  of  this  proposition  relative  to  a huge 
cardinal,  and  even  gave  a heuristic  argument  for  why  something  like  it  may  be 
necessary. 

Kunen  collapsed  a huge  cardinal  to  &>i  in  such  a way  that  a residue  of  a huge 
embedding  is  retained  as  an  ^-saturated  ideal  over  co\.  Not  only  are  such  ideals 
of  general  interest  as  a combinatorial  feature  low  in  the  cumulative  hierarchy  but 
Solovay  [71]  had  shown  that  they  have  a intriguing  technical  property:  Starting 
with  such  an  ideal  I there  is  a forcing  extension  V[G]  (in  fact  via  the  p.o.  V{w\)  — 
I)  in  which  there  is  a generic  elementary  embedding,  a j:  V -<  M with  critical 
point  (x>\.  Kunen  observed  that  C M,  so  that  j is  like  an  embedding 

witnessing  the  almost  hugeness  of  co\ . But  the  embedding  has  domain  just  the 
ground  model  V,  and  only  exists  in  the  generic  extension  V[G],  Still,  this  was  a 
heuristic  argument  taken  to  suggest  that  saturated  ideals  might  have  consistency 
strength  on  the  order  of  hugeness,  that  the  generic  embedding  might  foreshadow 
a full-fledged  embedding. 

Kunen’s  work  emboldened  set  theorists  to  use  strong  large  cardinal  hypothe- 
ses as  in  the  formative  work  of  Foreman  [82,83],  and  generated  a continuing 
interest  in  ideals  with  strong  properties  low  in  the  cumulative  hierarchy.  In  fact, 
this  interest  was  not  only  to  trigger  the  major  advances  toward  the  consistency  of 
determinacy,  but  to  continue  to  interweave  ideals  into  the  unifying  theory  as  one 
of  a triad:  large  cardinals,  determinacy,  and  ideals. 

Two  synthetic  results  in  the  late  1970’s  injected  a sense  of  anticipation  as 
well  as  mystery  into  the  growing  mosaic.  One  was  the  Martin  result  31.8  that 
12  implies  Defillj),  and  the  other  was  the  result  of  Steel- Van  Wesep  [82]  that  in 
ZF  if  AD®  and  & is  regular,  then  in  a forcing  extension  there  is  an  inner  model 
of  ZFC  in  which  NS,U1  is  a>2-saturated.  The  previous  work  of  Martin  and  others 
had  suggested  that  Detjllj)  is  an  apparently  very  strong  hypothesis,  but  the  upper 
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bound  established  by  Martin  raised  the  specter  of  having  to  reach  up  to  the  top  of 
the  large  cardinal  hierarchy  to  pull  in  determinacy.  The  Steel- Van  Wesep  result 
established  the  consistency  of  a basic  proposition  about  stationary  subsets  of  an 
that  had  not  been  derived  from  large  cardinals;  recalling  the  ideal  terminology 
from  §16,  Kunen’s  ideal  was  not  even  of  form  NS^JS  for  any  stationary  S c an. 
This  raised  the  prospect  of  having  to  enlist  determinacy  hypotheses  in  order  to 
secure  the  consistency  of  strong  combinatorial  propositions  in  ZFC. 

Both  the  Martin  and  Steel- Van  Wesep  results  were  improved  by  Woodin. 
Around  1981  he  reduced  the  hypothesis  of  the  latter  result  to  just  AD  (see  his 
[83a]),  and  in  early  1984  he  improved  the  conclusion  of  Martin’s  result  to  ADi(R), 
albeit  starting  from  the  stronger  10  (31.9).  This  was  an  important  advance  in  that 
it  did  subsume  AD  into  the  large  cardinal  hierarchy,  but  any  possibility  of  an 
equiconsistency  result  remained  remote.  It  turned  out  that,  as  sometimes  happens 
in  mathematics,  a fresh  infusion  of  ideas  from  a different  quarter  was  to  lead  to  a 
major  reorientation  and  generate  new  results. 

The  break  came  in  the  work  of  three  fine  mathematicians  at  the  forefront  in 
the  investigation  of  large  cardinals,  Foreman,  Magidor,  and  Shelah.  In  a major 
collaboration  in  early  1984  in  Jerusalem  they  [88,88a]  implemented  a project 
of  larger  significance  for  set  theory  which  led  to  a new  understanding  of  strong 
propositions  and  the  possibilities  with  forcing.  Applying  the  powerful  semi-proper 
forcing  techniques  of  Shelah  [82],  they  [88]  collapsed  a supercompact  cardinal  k to 
a> 2 via  a k- c.c.  forcing  so  that  in  the  extension  a provably  maximal  form  of  Martin’s 
Axiom,  Martin  ’s  Maximum,  holds.  They  then  established  the  relative  consistency 
of  several  propositions  by  deriving  them  directly  from  this  new  axiom.  One  such 
proposition  was  that  NSffil  is  «2 -saturated.  Hence,  not  only  was  the  upper  bound 
for  the  consistency  strength  of  having  an  ^-saturated  ideal  over  a>\  considerably 
reduced  from  Kunen’s  huge  cardinal,  but  for  the  first  time  the  consistency  of 
NSWl  itself  being  ^-saturated  was  established  relative  to  large  cardinals  rather 
than  determinacy. 

With  their  work  Foreman,  Magidor,  and  Shelah  had  overturned  a long-held 
view  about  the  scaling  down  of  large  cardinal  properties.  In  the  first  flush  of 
new  hypotheses  and  propositions,  Kunen  had  naturally  enough  collapsed  a large 
cardinal  to  a>\  in  order  to  transmute  strong  properties  of  the  cardinal  into  an  a>2- 
saturated  ideal  over  an,  and  this  sort  of  direct  connection  had  become  the  rule. 
The  new  discovery  was  that  a careful  collapse  of  a large  cardinal  to  a> 2 instead 
can  provide  enough  structure  to  secure  such  an  ideal.  In  particular,  the  generic 
elementary  embedding  given  by  the  further  forcing  with  the  saturated  ideal  does 
not  extend  any  previous  embedding  associated  with  the  original  large  cardinal. 

And  now  for  a crucial  connection  that  was  made  with  Lebesgue  measurability. 
As  described  in  Foreman-Magidor-Shelah  [88:  §3],  results  of  Shelah  and  Woodin 
when  put  together  led  to  a direct  implication  about  L(R)  from  supercompactness: 

Martin’s  Maximum  implies  that  2K°  = K2.  Shelah  realized  that  the  tech- 
nique of  S-closed  forcing  from  his  [82]  can  be  incorporated  into  the  argument 
for  Martin’s  Maximum  to  get  a version  of  that  axiom  consistent  with  CH:  The 
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supercompact  cardinal  k is  collapsed  by  an  amended  k- c.c.  forcing  which  more- 
over is  &>i-Baire  and  so  does  not  add  any  new  reals.  Starting  in  the  ground  model 
V with  an  S c coi  so  that  both  S and  co\  — S are  stationary,  this  S-closed  forcing 
arranges  in  any  generic  extension  L[G]  that  NS^Koji  — S)  is  ^-saturated. 

In  long-distance  telephone  conversation  with  Foreman  in  April  1984,  Woodin 
outlined  a proof  that  with  the  reals  unchanging  in  such  a collapse,  the  existence  of 
a generic  elementary  embedding  of  the  type  given  by  an  ^-saturated  ideal  over 
&>i  in  a further  extension  of  V[G]  implies  that  in  the  ground  model  V every  set  of 
reals  in  L(R)  is  Lebesgue  measurable.  This  depended  on  the  Solovay  result  11.1 
and  the  known  fact  that  for  large  cardinals  k,  the  reals  in  any  generic  collapse  of 
k to  ol> i via  a k- c.c.  forcing  coincide  with  the  reals  of  a Col(&>,  k)  Levy  collapse. 
(Notably,  arguments  using  a generic  elementary  embedding  to  secure  Lebesgue 
measurability  in  the  ground  model  had  occurred  in  previous  work  of  Magidor 
[80]  and  Foreman  [86].)  The  Shelah  realization  and  the  Woodin  observation 
established:  If  there  is  a supercompact  cardinal,  then  every  set  of  reals  in  L(R)  is 
Lebesgue  measurable. 

Such  an  implication  from  a large  cardinal  hypothesis  precludes  the  possibility 
of  any  inner  model  of  it  with  a reasonable  well-ordering  of  the  reals  (cf.  the 
remarks  after  31.8).  With  the  familiar  supercompact  cardinals  exhibiting  such 
“wildness”  the  quest  was  on  to  reduce  the  large  cardinal  hypothesis  sufficient  for 
the  implication.  This  was  later  described  in  Shelah- Woodin  [90],  the  concepts 
involved  having  been  explored  in  §26:  Woodin  first  noticed  that  a superstrong 
cardinal  suffices.  While  Woodin  was  visiting  Jerusalem  in  June  1984,  Shelah 
isolated  a weaker  concept  incorporating  functional  conditions,  later  called  a Shelah 
cardinal,  and  implemented  his  crucial  idea  of  investing  one  such  cardinal  to  get 
Lebesgue  measurability  for  each  level  of  the  projective  hierarchy.  But  such  a 
cardinal  still  reflected  the  requisite  technical  properties  downward,  and  so  less 
would  do.  Soon  afterward  Woodin  formulated  the  crucial  concept  of  Woodin 
cardinal  by  inverting  quantifiers  in  Shelah’s  definition,  and  showed  that  it  suffices 
for  Shelah’s  scheme: 

32.9  Theorem  (Shelah- Woodin  [90]). 

(a)  If  n € a>  and  there  are  n Woodin  cardinals  with  a measurable  cardinal 
above  them,  then  every  £*+2  set  of  reals  is  Lebesgue  measurable. 

(b)  If  there  are  infinitely  many  Woodin  cardinals  with  a measurable  cardinal 
above  them,  then  every  set  of  reals  in  L( R)  is  Lebesgue  measurable. 

The  n — 0 case  of  (a)  is  Solovay’s  14.3.  Woodin  cardinals  seemed  to  encap- 
sulate just  what  is  needed  to  carry  out  the  collapsing  argument  that  led  to  saturated 
ideals,  and  moreover  the  proofs  suggested  optimality  - recall  that  Woodin  cardinals 
are  not  necessarily  weakly  compact  so  do  not  directly  reflect  substantial  proper- 
ties. The  measurable  cardinal  hovering  above  was  to  be  a recurring  theme,  the 
purpose  being  loosely  speaking  to  maintain  a stable  environment  with  the  exis- 
tence of  sharps.  Since  Woodinness  in  consistency  strength  is  much  stronger  than 
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measurability,  it  is  a mild  if  annoying  peculiarity  of  the  theory.  Finally,  32.9  was 
the  first  instance  of  a circumstance  remarkable  granting  the  optimality  the  hypothe- 
ses: (a)  leads  from  infinitely  many  Woodin  cardinals  to  the  Lebesgue  measurability 
of  every  projective  set  of  reals,  yet  the  marginally  stronger  hypothesis  of  (b)  sud- 
denly opens  the  floodgates  to  the  measurability  of  every  set  of  reals  in  L(R). 

As  part  of  this  development  Shelah  in  early  1985  reduced  the  consistency 
strength  for  saturated  ideals  to  a Woodin  cardinal: 

32.10  Theorem  (Shelah).  Suppose  that  k is  Woodin.  Then  there  is  a forcing 
extension  in  which  k — u> 2 and  NS^  is  a>2-saturated. 

As  with  versions  of  Martin’s  Maximum  consistent  with  CH,  Shelah  was  also  able 
to  get  CH  to  hold  in  the  extension  at  the  cost  of  the  ideal  being  of  the  less  canonical 
form  NSaJS.  See  Shelah  [87]  for  related  results. 

Woodin  made  an  important  conceptual  move  in  the  Autumn  of  1984  that 
streamlined  much  of  the  previous  work.  That  work  had  turned  on  the  process  of 
collapsing  a large  cardinal  to  get  a saturated  ideal  and  then  a generic  elementary 
embedding.  As  in  any  case  there  is  a forcing  that  does  all  this,  Woodin  stalked  for 
the  essence  and  formulated  his  stationary  tower  forcing.  This  notion  of  forcing, 
given  sufficient  hypotheses,  directly  provides  a generic  elementary  embedding 
j:  V -<  M with  crit(j)  = w\  and  “Jl  C M among  a multitude  of  possibilities.  It 
looks  strange  at  first,  but  can  be  seen  as  an  outgrowth  of  the  work  of  Foreman- 
Magidor-Shelah  [88],  both  in  the  adoption  its  generalization  of  stationariness  and 
the  use  of  a key  “semi-propemess”  property  needed  to  “capture”  dense  open  sets. 
With  stationary  tower  forcing,  Woodin  eliminated  saturated  ideal  arguments  from 
the  derivation  of  the  regularity  properties  for  sets  of  reals. 

In  early  1985  Woodin  [88]  applied  his  stationary  tower  forcing  both  to  im- 
prove his  earlier  32.8  to  a direct  implication  and  to  replace  Lebesgue  measurability 
by  a prior  representability  for  sets  of  reals:  If  there  is  a supercompact  cardinal, 
then  eveiy  set  of  reals  in  L(R)  is  weakly  homogeneously  Suslin.  With  Woodin 
cardinals  soon  seen  to  provide  just  the  right  technical  strength  to  implement  sta- 
tionary tower  forcing  Woodin  then  weakened  the  hypothesis  to  an  accumulation 
of  many  Woodin  cardinals.  He  did  not  get  to  the  optimal  result,  however,  before 
a major  advance  was  made  from  a different  quarter. 

Woodin  Cardinals  and  Determinacy 

With  the  hypotheses  dramatically  reduced  for  getting  the  Lebesgue  measurability 
of  the  projective  sets,  the  growing  web  of  connections  set  the  stage  for  Martin 
and  Steel’s  breakthrough  in  September  1985.  The  new  Woodin  cardinals  held  out 
the  hope  for  getting  results  along  the  lines  of  Martin’s  correlations  of  Det(a-IlJ) 
with  inner  models  having  many  measurable  cardinals.  Alive  to  the  possibility 
Martin  and  Steel  were  able  to  derive  determinacy  by  converting  an  obstacle  in 
the  developing  inner  model  theory  into  an  asset.  They  wrote  in  their  systematic 
exposition  [89:  74]: 
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. . . the  authors  owe  a great  debt  to  the  work  of  Foreman,  Magidor,  Shelah,  and  Woodin. 
Nevertheless,  the  technical  ideas  in  our  proof  have  no  relation  to  that  work.  We  arrived 
at  our  proof  by  asking  what  it  is  about  Woodin  cardinals  that  makes  their  inner  models 
so  complicated.  In  earlier  work  on  inner  model  theory,  there  had  arisen  the  worry  that 
superstrong  cardinals  might  generate  complicated  “iteration”  trees  ....  It  turns  out  that 
Woodin  cardinals  generate  such  trees,  and,  while  this  is  a problem  for  inner  model  theory, 
it  can  be  used  to  prove  determinacy. 

Iteration  trees  was  the  means  through  which  Martin  and  Steel  established  their  re- 
sults, and  Woodin  cardinals,  complementing  their  role  in  stationary  tower  forcing, 
provided  just  the  right  technical  strength  to  construct  such  trees.  Woodin  cardinals 
thus  became  firmly  established  as  the  central  large  cardinal  concept  in  the  study 
of  the  consistency  of  determinacy. 

Martin  and  Steel’s  main  result  was  a “transfer”  theorem  that  can  be  moti- 
vated as  a means  of  propagating  determinacy  through  the  projective  hierarchy  via 
homogeneous  trees,  just  as  reduction  was  propagated  via  prewellorderings  and 
uniformization  via  scales.  If  there  is  a measurable  cardinal  k,  then  the  Ilj  sets 
are  k -homogeneously  Suslin  and  so  are  determined  (32.1,32.2),  and  £2  sets  are 
a: -weakly  homogeneously  Suslin  (32.3).  A means  must  now  be  devised,  as  for  the 
other  regularity  properties,  to  continue  to  the  next  level.  To  a weakly  homoge- 
neous tree  T corresponds  a tree  7 so  that  p[T]  = ma>  — p[T]  (32.6).  The  transfer 
theorem  used  a Woodin  cardinal  and  the  aforementioned  iteration  trees  to  derive 
homogeneity  for  T,  and  thus  determinacy  for  p[T],  from  the  weak  homogeneity 
of  T: 

32.11  Theorem  (Martin-Steel  [88,  89]).  Suppose  that  X is  a Woodin  cardinal  and 
T is  a X+ -weakly  homogeneous  tree.  Then  for  any  y < X,  T is  y -homogeneous. 

Hence,  continuing  from  £2  being  k -weakly  homogeneously  Suslin,  if  there 
is  a Woodin  cardinal  Xq  < k,  then  the  Il2  sets  are  //-homogeneously  Suslin  for 
any  //  < X0  and  £3  sets  are  //-weakly  homogeneously  Suslin  for  such  //.  And  if 
there  is  a Woodin  cardinal  Xi  < ko,  the  propagation  can  continue: 

32.12  Corollary.  If  n e co  and  there  are  n Woodin  cardinals  with  a measurable 
cardinal  above  them,  then  Det(TT('  + l ). 

The  form  of  this  result  had  been  suggested  by  the  reduction  32.9(a),  remembering 
that  Defill,'^)  implies  that  every  L('+2  set  of  reals  is  Lebesgue  measurable  (27.14). 
By  July  1986,  Martin  and  Steel  (see  their  [94])  had  advanced  the  inner  model 
theory  enough  to  get  such  a model  with  n Woodin  cardinals  and  a X'(J+2-good 
well-ordering  of  the  reals.  Such  a well-ordering  is  not  Lebesgue  measurable 
(cf.  13.10),  and  so  in  32.12  (and  32.9(a))  the  measurable  cardinal  hovering  above 
cannot  be  eliminated. 

Within  weeks  after  the  Martin-Steel  result  32.12  was  achieved,  Woodin  in  the 
autumn  of  1985  had  established  the  optimal  result  for  getting  weakly  homogeneous 
representations  in  L(R): 
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32.13  Theorem  (Woodin).  Suppose  that  there  are  infinitely  many  Woodin  cardinals 
with  a measurable  cardinal  above  them.  Then  for  any  X less  than  the  supremum 
of  those  Woodin  cardinals,  eveiy  set  of  reals  in  L(R)  is  X-weakly  homogeneously 
Suslin. 

The  following  is  an  immediate  consequence  of  32.11  and  32.13,  and  as  with 
the  latter  it  is  surprising  in  light  of  the  optimality  of  32.12  that  just  having  one 
measurable  cardinal  above  should  encompass  all  sets  of  reals  in  L(R). 

32.14  Corollary.  If  there  are  infinitely  many  Woodin  cardinals  with  a measurable 
cardinal  above  them,  then  ADL<Rh 

Although  the  pace  of  narrative  is  slowed,  a component  of  32.13  is  estab- 
lished to  convey  something  of  how  large  cardinals  provide  weakly  homogeneous 
representations  for  sets  of  reals: 

32.15  Proposition  (Woodin).  Suppose  that  k is  inaccessible  and  there  is  a generic 
extension  V[G]  in  which:  k = a)\  and  there  is  a j : V < M with  crit(_/)  = 
and  WM  C M.  Suppose  also  that  A C and  there  are  trees  T and  T'  such  that 
A — p[T\  and  p\T ']  = ma>  — p[T],  and  this  also  holds  in  V[G\  Suppose  finally 
that  X < k is  T -strong.  Then  A is  X-weakly  homogeneously  Suslin. 

Remarks.  The  existence  of  such  a generic  extension  follows  via  stationary  tower 
forcing  when  k is  Woodin.  The  existence  of  such  tree  representations  for  an 
arbitrary  set  A of  reals  and  its  complement,  robust  through  generic  extensions, 
follows  from  the  existence  of  infinitely  many  Woodin  cardinals  with  a measurable 
cardinal  above  them.  Finally,  after  a contextual  argument  showing  that  for  present 
purposes  T c VK  can  be  assumed,  the  existence  of  arbitrarily  large  X < k such 
that  X is  T -strong  follows  from  26.14  when  k is  Woodin.  In  particular,  this  result 
is  a weak  converse  to  32.6  in  the  presence  of  a Woodin  cardinal. 

Proof.  It  is  first  deduced  that  because  X is  T-strong,  A = p[T\  = p[T  fl  Vx], 
and  the  expected  argument  is  used  to  generate  ultrafilters  that  motivate  the  main 
construction: 

Suppose  that  x e A,  say  with  (x,  f)  e [T],  Let  y — rank(/)  and  e\  V -<  N 
be  such  that  crit(e)  = X,  y < e(X),  Vy+i  c N,  and  T fl  Vy+\  — e(T ) fl  VY+\.  It 
follows  readily  that  (x,  f)  e [e(T)~\,  and  so  by  elementarily  there  is  some  g £ Vx 
such  that  (x,  g)  e [T].  But  then,  x e p[T  fl  Vj.]- 

Continuing  with  the  (x,  f)  e [T]  and  e as  above,  for  m e a>  define  Um  by: 

X e Um  iff  X c vk  A f\m  e e(X)  . 

Then  Um  is  a k-complete  (possibly  principal)  ultrafilter  over  Vx.  The  definition 
recalls  the  formulation  of  extenders  from  elementary  embeddings  (§26),  except 
that  the  function  f\m  is  used  rather  than  the  set  ran(/|m).  As  for  extenders, 
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{ Um  | m e co)  is  a countably  complete  tower:  That  it  is  a tower  follows  from  the 
definition  of  the  s.  That  it  is  countably  complete  can  be  verified  directly  from 

elementarily,  since  if  Xm  e Um  for  m e co,  then  as  f\m  e e(Xm)  for  such  m and 
f e N,  there  must  be  some  d e V such  that  d\m  e Xm  for  such  m.  Finally,  note 
that  for  m e co, 

(T  n Vk)x\m  e Um  iff  f\m  e e{Tx iff  f\m  e e(T)x jm  iff  {x\m,  f\m)  e e(T ) , 

but  this  last  holds,  as  noted  before. 

Setting 

£ = {U  | U is  a A-complete  ultrafilter  over  V>_}  , 

the  above  argument  shows  that  £ verifies  the  A-weak  homogeneity  of  T IT  Vj,  in  the 
sense  of  the  characterization  32.4,  except  that  £ is  not  countable.  It  is  to  remedy 
this  defect  that  the  other  hypotheses  are  used  in  a further  reflection  argument. 
Note  first  the  £ is  countable  in  V[G]  since  k is  inaccessible  in  V and  k = co\  in 
V\G\.  Since  M c M,  it  readily  follows  that  j“£  e M and  is  countable  there. 
Building  on  this  it  will  be  argued  that  there  is  enough  cohesion  between  V and 
M to  establish  the  claim,  with  that  32.4  sense  of  “verifies”,  that 

M \=  j“£  verifies  the  j (A)-weak  homogeneity  of  j(T  fl  V>j  . 

Once  this  is  established  the  proof  would  be  complete,  since  by  elementarily  there 
would  be  some  witness  to  the  A-weak  homogeneity  of  T n V\,  and  A — p[T  n V-,  ]. 

To  establish  the  claim,  suppose  that  M \=  x e p[j  (T  n Vj.)].  It  must  be 
shown  that  in  M there  is  a countably  complete  tower  (Wm  \ m e a>)  e mj“£  such 
that  j(T  n Vf)x\ m e Wm  for  every  m e co.  First  note  that  since  p[T']  = wco  — p[T] 
in  V,  M |=  x p\j(T')\  by  elementarily.  It  follows  from  this  that  x e p[T~\  in 
V\G\. 

Arguing  in  V[G],  assume  to  the  contrary  that  x £ p[T\.  Then  since 
p[T']  = wco~ p[T]  persists  there  by  hypothesis,  x e p[T'],  say  with  (x,  g)  e [T'~\. 
It  follows  readily  that  (. x,j“g ) e [j(T')]  where  j“g  is  of  course  the  func- 
tion {{n,  j (g(n)))  | n e co},  and  moreover  j“g  e M as  mM  c M.  But  then, 
M \=  x e p[j{T')],  which  is  a contradiction. 

With  x e p[T]  in  V[G ] confirmed,  let  / e V[G]  be  such  that  (x,  f)  e [T] 
and  set  rank (/)  = y.  In  V,  let  e:  V < N be  as  at  the  beginning  of  this  proof, 
with  crit(e)  = X,  y < e(X),  Vy+ 1 C N,  and  T n Vy+l  = e{T)  n Vy+l.  Although 
/ is  not  necessarily  in  V,  for  each  m e a>  the  finite  set  f\m  can  be  used  in  V as 
before  to  define  an  ultrafilter  Um  e £. 

In  V[G],  f is  available  and  so  also  is  the  sequence  {Um  \ m e co), 
and  as  argued  previously  this  sequence  is  a countably  complete  tower  such 
that  ( T n Vf)x\ m ^ Um  for  each  m e co.  It  follows  from  mM  c M that 
j“f  = {{n,  j (/(«))}  | n e co}  e M and  ( j(Um ) | m e co)  e M,  and  from  elemen- 
tarily that  this  sequence  is  a tower  drawn  from  j“£  such  that  j(T n V\)x\m  e j ( U„ , ) 
for  each  m e co.  To  complete  the  proof,  it  is  checked  that  this  tower  is  countably 
complete  in  M: 
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In  V sete  = e \ V)  + \ ; then  in  the  definition  of  the  Um ’s,  e can  be  replaced  by  e. 
In  its  terms  the  previous  argument  for  the  U„,  ’s  can  be  transferred  from  V : Suppose 
that  in  M,  Xm  e j(Um)  for  m e co,  so  that  by  elementarity  j ( f\m ) e j(e)(Xm)  for 
such  m.  Since  j“f  e M,  and  by  elementarity  rank (7“/)  < j (y)  and  V^Y)+X  c 
ran(  j (<?)),  it  follows  that  j“f  e ran  (j(e)),  and  of  course  (j“f)\m  — for 

m e co.  But  then,  the  elementarity  of  j (e)  implies  that  there  must  be  some  d e M 
such  that  d\m  e Xm  for  every  m e co.  This  completes  the  proof  of  the  claim  about 
j“£  in  M,  and  with  it,  the  theorem.  H 

This  argument  highlights  the  crucial  roles  played  in  the  theory  of  the  property 
10 M C M and  of  p[T']  = OJco  — p[T]  holding  in  V[G ] as  well  as  in  V. 

With  the  consistency  strength  of  AD  gauged  by  32.14,  Woodin  soon  estab- 
lished the  crowning  equiconsistency  result: 

32.16  Theorem  (Woodin).  The  following  are  equiconsistent: 

(a)  ZFC  + There  are  infinitely  many  Woodin  cardinals. 

(b)  ZF  + AD. 

Whether  or  not  this  had  to  be  the  theorem,  the  scant  remarks  made  here  can 
convey  little  of  the  depth  and  sophistication  of  the  proof  (for  which  and  others  see 
Woodin-Mathias-Hauser  [00]).  In  the  forward  direction,  infinitely  many  Woodin 
cardinals  are  all  made  countable  and  a sequence  (G„  | n e co)  of  generics  for 
their  stationary  tower  forcings  is  developed  that  cohere:  In  each  V[Gn\  there  is 
a generic  elementary  embedding  jn:  V -<  Mn,  and  for  n < n there  are  maps 
jn  jj\  Mn  M-p;  that  commute  with  the  j„’s.  Taking  to  be  the  reals  of  M„,  the 
desired  model  of  AD  is  LdJ;iR„).  The  final  argument  is  one  by  contradiction 
based  on  a constructively  least  counterexample.  For  that  allegedly  undetermined 
set  A of  reals  a robust  tree  representation  for  it  and  its  complement  is  found,  and 
from  the  consequent  weak  homogeneity  a contradiction  is  reached  by  applying  the 
Martin-Steel  result  32.11  to  derive  the  determinacy  of  A. 

For  the  converse  direction,  it  is  first  established  that  if  AD  and  V — L(R), 
then  © is  Woodin  in  HOD:  HOD  is  of  course  the  class  of  hereditarily  ordinal  defin- 
able sets.  AD  is  applied  most  directly  in  an  argument,  recalling  Solovay’s  original 
proof  of  the  measurability  of  oj\ , to  get  certain  ultrafilters.  A crucial  normality 
property  is  then  derived  for  these  ultrafilters  through  the  definability  properties 
available  in  HOD.  Finally,  a uniform  version  of  the  Moschovakis  Coding  Lemma 
is  applied  to  show  that  these  ultrafilters  provide  embeddings  of  HOD  that  witness 
the  strongness  properties  below  0 needed  to  affirm  its  Woodinness. 

To  get  an  inner  model  with  infinitely  many  Woodin  cardinals,  an  infinite  3- 
decreasing  chain  ( M ,■  i € of  of  inner  models  is  first  constructed  based  on  the 
above  HOD  result.  Starting  with  Mo  = HODi<R),  M\  is  (the  transitive  collapse 
of)  the  ultrapower  of  Mo  constructed  in  L(W)  using  an  analogue  of  the  Martin 
ultrafilter  over  the  Turing  degrees.  The  ultrapower  indirectly  generates  a new  set 
of  reals  Ri  such  that  Ri  3 R,  and  an  embedding:  L(  R)  -<  L(Ri).  By  elementarity 


32.  Consistency  of  AD 


467 


AD  holds  in  L(Ri)  and  M\  = HOD1®'1,  and  the  construction  is  iterated  to  get  all 
the  Mi’s.  Recasting  the  HODi<R)  (=  0 is  Woodin  result,  an  increasing  sequence 
{Oi  i £ co)  of  ordinals  is  then  found  with  accompanying  “witnesses”  W,  C 0,- 
defined  uniformly  so  that  9j  is  Woodin  in  M,[W,].  With  W — (Wj  | i e to) 
and  M — P|,-  the  argument  is  concluded  by  showing  that  in  M[W\  each  9,  is 
Woodin. 

As  described  in  §31,  Defillj)  had  been  a focal  hypothesis  for  the  consistency 
study  of  determinacy  through  the  1970’s.  In  1989  Woodin  settled  matters  with 
the  following  elegant  characterizations: 

32.17  Theorem  (Woodin).  The  following  are  equiconsistent: 

(a)  ZFC  + Det (77] ). 

(b)  ZFC  + There  is  a Woodin  cardinal. 

32.18  Theorem  (Woodin).  The  following  are  equivalent: 

(a)  DefinJ,). 

(b)  For  any  a e <°co  there  is  a countable  ordinal  8 such  that  8 is  Woodin  in 
an  inner  model  of  ZFC  containing  a. 

The  (a)  to  (b)  direction  figured  in  the  proof  of  32.16,  and  begins  from  a 
surprising  result  of  Kechris-Solovay  [85]  in  ZF  + DC,  that  for  any  a e co,  L[a]\= 
Det(Ai)  implies  Det(OD).  That  is,  in  such  models  determinacy  for  lightface  A\ 
sets  already  implies  determinacy  for  the  largest  class  of  lightface  “definable”sets, 
the  ordinal  definable  sets. 

It  is  known  that  there  is  no  direct  analogue  of  32.17  for  Det(/7,J)  when  n > 2 
in  terms  of  merely  having  a number  of  Woodin  cardinals.  Woodin  has  explored 
this  terrain,  and  has  tentative  large  cardinal  formulations  for  equiconsistencies. 

What  about  ADR?  By  our  surface  remarks  at  the  end  of  §27  no  inner  model 
of  ADR  can  be  of  form  L(RU{5})  for  any  Kl.  Woodin  was  able  to  show  that 
if  there  is  an  inner  model  of  ADR  containing  every  real,  then  there  is  a smallest 
such  model  (see  Steel  [88:  §3]).  Moreover,  he  was  able  to  get  one  from  a moderate 
augmentation  of  having  infinitely  many  Woodin  cardinals: 

32.19  Theorem  (Woodin).  Suppose  that  there  is  a 8 such  that  cf(<$)  — co,  8 is 
a limit  of  Woodin  cardinals,  and  8 is  also  a limit  of  cardinals  y -strong  for  every 
y < 8.  Then  there  is  a forcing  extension  in  which  there  is  an  inner  model  containing 
every  real  and  satisfying  ADR. 

The  proof  is  along  the  lines  of  that  for  the  forward  direction  of  32.16,  and 
recent  results  suggest  that  the  hypothesis  may  be  the  exact  consistency  strength 
for  ADr.  As  mentioned  earlier,  ZF  + ADR  + r6>  is  regular1  is  a strong  hypothesis 
that  implies  the  consistency  of  ZF  + ADR  (Solovay  [78a]),  and  served  as  the  first 
sufficient  hypothesis  for  getting  the  relative  consistency  of  NSWl  is  ^-saturated 
(Steel- Van  Wesep  [82]).  There  were  various  indications  that  ADR  + is  regular1 
has  stronger  consistency  strength  than  any  accumulation  of  Woodin  cardinals,  but 
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in  1992  Woodin  was  able  to  tame  even  this  hypothesis.  Recall  from  §24  the 
hypothesis  II  that  for  some  <5,  there  is  a j:  V^+i  -<  V&+\,  and  that  a marginally 
stronger  hypothesis  just  short  of  Kunen’s  inconsistency  figured  in  the  first  proof 
of  the  relative  consistency  of  AD  (31.9). 

32.20  Theorem  ( Woodin).  Suppose  that  there  is  a j : V&+\  -<  Vs+ifor  some  8,  say 
with  crit(y)  = k.  Then  the  following  holds  in  any  generic  extension  via  Col(o),  k): 
There  is  an  inner  model  containing  every  real  and  satisfying  ADr  + ^0  is  regular1 . 

Large  cardinals  have  reasserted  themselves  once  again,  but  surely  the  hypoth- 
esis here  is  not  optimal. 

Among  a growing  number  of  synthetic  results  that  have  built  a large  web  of 
connections  around  determinacy,  the  more  accessible  are  cursorily  described  to 
bring  this  overview  to  a close. 

Determinacy  in  Terms  of  Regularity  Properties 

In  his  early  investigations  toward  equivalences  for  determinacy  (cf.  30.28)  Woodin 
considered  the  possibility  that  determinacy  hypotheses  might  be  equivalent  to  some 
of  their  structural  consequences  for  sets  of  reals.  He  posed  the  following  question 
for  the  projective  sets,  since  answered  in  the  negative  as  dicussed  below. 

32.21  Question  (ZF  + DC)(Woodin  [82]).  Are  the  following  equivalent: 

(a)  PD. 

(b)  Every  projective  set  of  reals  is  Lebesgue  measurable  and  has  the  Baire 
property,  and  every  projective  subset  of  2(coa> ) can  be  uniformized  by  a projective 
set. 


Lebesgue  measurability  and  having  the  Baire  property,  being  antithetical  to 
choice  principles,  are  complemented  by  uniformizability  as  just  such  a principle, 
and  these  consequences  of  PD  are  together  to  provide  the  strength  to  recover  PD. 
If  so,  a hypothesis  with  ad  hoc  features  would  gain  considerably  in  justification 
and  explication  through  being  equated  with  consequences  of  structural  immediacy. 
Although  Woodin  [82]  did  not  answer  his  question,  he  was  able  to  derive  from 
(b)  that  Vfl  e aco(a'1  exists)  («^  was  defined  at  the  end  of  §21). 

With  the  resolution  of  32.21  the  ultimate  goal,  Hauser  [95]  in  the  wake  of 
considerable  advances  in  inner  model  theory  established  that  projective  absolute- 
ness, the  proposition  that  projective  relations  are  absolute  for  V in  any  generic 
extension  (cf.  15.6),  is  equiconsistent  with  the  existence  of  infinitely  many  strong 
cardinals. 

Then  somewhat  unexpectedly,  Steel  in  the  Fall  of  1997  answered  32.21  in 
the  negative.  With  the  relevance  of  strong  cardinals  seen  and  building  on  work  of 
Woodin,  Steel  established  that  the  consistency  of  32.21(b)  already  follows  from 
the  consistency  of  the  rather  weak  proposition  (*):  There  is  an  increasing  sequence 
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{k„  | n e w)  of  cardinals  with  k their  supremum  such  that:  For  any  new  and 
x e V-,+ 1 there  is  a j:  V -<  M with  crit(j)  = k„  and  x e M. 

Flauser-Schindler  [00]  then  established  a relative  consistency  result  that  was 
just  short  of  being  the  converse  to  Steel’s,  using  inner  model  apparatus  that  neces- 
sitated the  existence  of  a measurable  cardinal  above.  Finally,  resolving  the  issue 
with  an  exact  large  cardinal  analysis  Schindler  [02]  established  the  equiconsistency 
of  32.21(b)  with  (*). 

In  the  early  1990’s  Woodin  was  able  to  answer  the  question  analogous  to 

32.21  for  full  AD  in  L(R);  recall  the  30.25  delimitation  at  772  for  uniformization 
there. 

32.22  Theorem  (ZF  + DC)(Woodin).  Assume  V — L(R).  Then  the  following  are 
equivalent: 

(a)  AD. 

(b)  Every  set  of  reals  is  Lebesgue  measurable  and  has  the  Baire  property,  and 
every  E2  subset  of  2(cow)  can  be  uniformized. 

The  new  (b)  to  (a)  direction  provided  by  Woodin  used  recent  results  of  Steel 
in  inner  model  theory.  The  import  of  32.21  for  PD  carries  over  to  32.22  for  AD, 
and  this  result,  characterizing  the  regularity  properties  for  the  large  class  of  sets 
in  L(R),  can  be  viewed  as  a culmination  of  the  efforts  of  the  classical  descriptive 
set  theorists. 

What  about  ADr  in  terms  of  regularity  properties?  Formulations  along  these 
lines  stirred  interest  as  a possible  approach  to  comprehending  this  strong  hypothe- 
sis. ADr  implies  that  every  subset  of  2(“«)  can  be  uniformized  (27.15).  Consider 
the  following  hypothesis  intermediate  between  AD  and  ADr: 

1 /2 

(ADR  ) Every  game  where  one  player  plays  reals 

and  the  other  plays  integers  is  determined  . 

Clearly  this  hypothesis  suffices  for  the  proof  of  27.15  (since  player  I can  choose 
a real  x,  and  forgetting  about  the  rest  of  his  moves  player  II  can  be  required 
to  choose  integers  forming  a real  y such  that  (x,  y)  e A).  In  the  early  1980’s 
Kechris  [88]  established  the  converse  in  ZF  + DC:  AD-J/2  iff  AD  + r Every  subset 
of  z(‘°w)  can  be  uniformized1.  With  uniformizability  seemingly  a weak  choice 
principle,  this  result  suggested  that  ADK'  is  a weak  augmentation  of  AD. 

On  the  other  hand,  Woodin  showed  that  in  ZF  + DC,  AD  + Even,'  set  of 
reals  has  a scale1  implies  ADr.  Scales  provide  tree  representations  (30.2),  and 
these  in  turn  lead  to  representations  as  y-Borel  sets  (30.13(a))  and  analogues  of 
the  projective  ordinals  (cf.  30.12).  AD  + r Every  set  of  reals  has  a scale1  thus 
has  consequences  beyond  what  was  known  to  follow  from  AD-  : (i)  Every  set  of 
reals  is  oo-Borel,  i.e.  in  the  smallest  collection  of  sets  of  reals  containing  the  open 
sets  and  closed  under  complementation  and  the  taking  of  well-ordered  unions  (of 
arbitrary  length),  (ii)  There  are  arbitrarily  large  regular  successor  cardinals  below 
0 (those  analogues  of  the  projective  ordinals). 
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Having  scales  seemed  to  strengthen  AD  considerably  whereas  uniformizabil- 
ity  seemed  a weak  augmentation.  However,  using  results  of  Becker  [85],  Woodin 
in  the  early  1980’s  established  an  equiconsistency:  Con(ZF  + DC  + AD  + Ev- 
ery subset  of  2('"io)  can  be  uniformized)  iff  Con(ZF  + DC  + AD  + Every  set 
of  reals  has  a scale).  This  foreshadowed  equivalence,  and  in  the  wake  of  the 
developments  of  the  late  1980’s  Woodin  duly  established  this,  bringing  together 
the  various  hypotheses: 

32.23  Theorem  (ZF  + DC)(Woodin).  The  following  are  equivalent: 

(a)  ADr. 

(b)  AD  + Every  set  of  reals  has  a scale. 

(c)  AD  + Every  subset  of  2("o>)  can  be  uniformized. 

It  is  striking  that  ADr  could  be  equivalent  to  such  an  ostensibly  weak  form 
as  (c). 

Determinacy  in  Terms  of  Ideals 

With  the  investigation  of  ideals  having  led  to  the  major  break  for  determinacy, 
subsequent  developments  were  eventually  to  reverberate  back  to  inform  on  ideals. 
Mitchell  and  Steel  has  pursued  the  development  of  inner  models  for  Woodin 
cardinals,  and  the  latter  was  able  to  establish  that  Shelah’s  32.10  is  essentially 
optimal: 

32.24  Theorem  (Steel).  Suppose  that  there  is  an  a>2-saturated  ideal  over  <x>\  and 
there  is  a measurable  cardinal.  Then  there  is  an  inner  model  of  ZFC  in  which 
there  is  a Woodin  cardinal. 

Again  there  is  the  minor  annoyance  of  the  measurable  cardinal,  but  an 
equiconsistency  result  can  at  least  be  stated  for  saturated  ideals  and  Woodin  car- 
dinals by  incorporating  a measurable  cardinal  hovering  over  each. 

The  inner  model  theory  has  been  further  advanced  with  persistence  and  en- 
ergy. In  1992  Steel  used  an  inner  model  with  infinitely  many  Woodin  cardinals 
to  answer  a question  of  Woodin  [88:  6591]  with  the  following  result:  If  every  set 
of  reals  in  L(M.)  is  weakly  homogeneously  Suslin,  then  ADL<R). 

Sprinting  to  the  finish,  a pivotal  ideal  concept  is  quickly  introduced.  Building 
on  our  terminology  from  §16,  an  ideal  I over  a cardinal  k is  X-dense  iff  there  is 
a family  D c V(k)  — I with  \D\  = X such  that  for  any  X e V(k)  — I,  there  is 
a Y e D with  Y — X e I . This  is  a natural  notion  of  density  for  the  Boolean 
algebra  V{k)/ I . Clearly,  if  I is  X-dense  and  X < p,  then  I is  p- saturated . As 
part  of  his  sustained  interest  in  ideals,  Woodin  in  the  early  1980’s  had  improved 
the  Steel-Van  Wesep  [82]  result  mentioned  earlier  by  showing  in  ZF  that  if  ADr 
and  © is  regular,  then  in  a forcing  extension  there  is  an  inner  model  of  ZFC 
in  which  NSMl|.S  is  co\-dense  for  some  stationary  S c a>\.  He  later  drew  the 
same  conclusion  from  the  existence  of  an  almost  huge  cardinal.  Shelah  [86:  247ff] 
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showed  that  ZFC  imposes  an  intriguing  limitation:  If  2K°  < 2Kl,  then  NSWl  is  not 
an-dense.  In  particular,  CH  implies  the  conclusion.  With  the  continuum  being  a> 2 
in  32.10,  an  important  open  question  is  whether  CH  is  consistent  with  NS®,  being 
a>2-  saturated. 

Pushing  the  connections  between  ideals  and  determinacy,  Woodin  established 
the  following,  which  like  32.22  used  Steel’s  recent  work  in  inner  model  theory. 

32.25  Theorem  (Woodin).  Suppose  that  there  is  an  a> \ -dense  ideal  over  <x>\.  Then 
ADL(R). 

Woodin  [99]  would  go  on  to  develop  a vast  and  brillant  body  of  work  on 
canonical  models  based  on  forcing  and  large  cardinals  for  the  failure  of  the  Con- 
tinuum Hypothesis,  confronting  the  fundamental  problem  from  the  beginnings  of 
set  theory.  Pivoting  toward  this  new  context  Woodin  in  late  1992  established  the 
following  converse  to  32.25  for  consistency  strength: 

32.26  Theorem  (Woodin  [99:  366]).  Assume  ADL(R\  Then  there  is  a generic 
extension  of  L(R)  satisfying  ZFC  + rNSWl  is  u>\ -dense1. 

Putting  in  the  AD  result  and  stepping  back  we  conclude  this  book  with  a 
breath-taking  synthesis  of  large  cardinals,  determinacy,  and  infinite  combinatorics, 
one  that  speaks  to  the  great  achievements  that  have  been  made,  but  is  only  a 
prologue  to  Woodin  [99], 

32.27  Theorem  (Woodin).  The  following  are  equiconsistent: 

(a)  ZF  + AD. 

(b)  ZFC  + There  are  infinitely  many  Woodin  cardinals. 

(c)  ZFC  + The  non- stationary  ideal  over  <0 \ is  -dense. 


Chart  of  Cardinals 


The  arrows  indicates  direct  implications  or  relative  consistency  implications,  often 
both. 
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(With  apologies  to  Burton  Dreben) 

This  appendix  addresses  several  larger  and  more  discursive  issues  that  may  be 
raised  by  set  theory  and  made  more  acute  by  the  theory  of  large  cardinals.  An 
attempt  is  made  to  deflate  their  significance,  and  thereby  to  forestall  moves  that 
end  up  in  metaphysics.  The  history  and  practice  of  mathematics  in  general  and 
set  theory  in  particular  affirms  that  they  have  achieved  an  evident  autonomy,  one 
that  should  resist  external  explanations,  extrapolations,  or  prescriptions. 

Although  there  have  been  strong  metaphysical  motivations  for  doing  mathe- 
matics, it  has  steadily  emerged  through  a thinning  process  to  achieve  a degree  of 
invariance  that  distinguishes  it  from  other  human  endeavors.  As  Laplace  replied 
to  Napoleon  when  asked  why  God  did  not  appear  in  his  work,  “I  have  no  need 
of  that  hypothesis!”  This  invariance  is  evident  in  the  universality  of  mathematical 
language  and  acceptance  of  a body  of  results  and  procedures,  and  is  arguably  part 
of  the  sense  of  being  mathematical.  Moreover,  it  is  this  invariance  that  has  led  to 
the  cumulative  progress  of  mathematics,  one  more  sustained  than  in  science  and 
very  different  from  the  dialectical  to  and  fro  of  philosophy. 

On  the  other  hand,  the  broader  history  of  mathematics  undercuts  any  sug- 
gestion of  orderly  progress  and  moreover  features  sudden  spurts,  the  major  ones 
brought  about  by  successively  new  contexts  - complexes  of  terms,  approaches, 
and  procedures.  These  contexts  provided  new  senses  that  transformed  or  refined 
older  concepts  and  resolved  previous  questions  thus  recast,  eventually  stabilizing 
at  new  levels  of  invariance.  To  frame  a discussion  of  set  theory,  we  shall  review 
(with  interpretive  twists)  the  salient  features  in  the  evolution  of  the  continuum  to 
the  real  numbers,  i.e.  the  so-called  “arithmetization  of  the  continuum”. 

The  most  momentous  mathematical  discovery  made  by  the  classical  Greeks 
was  that  the  side  and  the  diagonal  of  a square  are  incommensurable,  in  our  terms 
that  V2  is  irrational.  Filtering  through  the  mysteries  of  the  Pythagorean  school  in 
the  5th  Century  B.C.,  this  first  impossibility  result,  reputedly  due  to  one  Hippa- 
sus  of  Metapontum,  overturned  their  belief  that  geometry  can  be  investigated  in 
terms  of  ratios  of  (the  natural)  numbers.  With  the  situation  further  accentuated  by 
Zeno’s  paradoxes,  collectively  an  attack  on  the  intelligibility  of  infinitary  dynamic 
processes,  the  response  was  one  of  the  great  achievements  of  Greek  mathematics: 
Eudoxus’s  theory  of  geometrical  proportions  as  described  in  Book  V of  Euclid’s 
Elements.  This  context  provided  for  the  ordering  of  possibly  incommensurable 
magnitudes,  and  was  thus  the  first  mathematization  of  the  continuum.  But  ge- 
ometry was  dominant  to  the  extent  that  numbers  had  the  sense  of  geometrical 
magnitudes,  the  product  of  two  numbers  qua  lengths  being  an  area  and  so  forth, 
and  multiplication  in  practice  was  not  iterated  more  than  a few  times. 

Almost  two  millenia  later,  the  emergence  of  mercantile  arithmetic  and  quan- 
titative scientific  investigation  proceeded  hand  in  hand  with  the  introduction  of 
basic  arithmetical  notation.  By  the  early  17th  Century  +,  x,  =,  < were  in 
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common  usage,  and  also  V and  variables  x,  y.  This  amounted  to  a new  con- 
text in  which  long  calculations  became  surveyable,  particular  irrational  numbers 
gained  an  operative  legitimacy,  and  algebra  could  develop  systematically.  Even 
before  this  standardization,  the  study  of  equations  in  streamlined  notations  had 
suggested  new  combinations  like  V— T that  outstripped  sense  but  pointed  to  new 
possibilities. 

In  the  mid- 17th  Century  the  great  philosopher  Rene  Descartes  set  the  stage 
for  the  subsequent  expansionary  period  in  mathematics  with  his  advocacy  of  a 
new  framework,  analytic  geometry,  for  the  study  of  the  continuum.  Establishing 
the  primacy  of  algebra  over  geometry,  not  only  did  Descartes  make  the  conceptual 
move  to  the  familiar  coordinate  system,  but  he  loosened  the  connection  of  multi- 
plication to  dimension  so  that  polynomials  could  be  investigated  without  e.g.  x2 
being  regarded  as  an  area,  and  he  shifted  attention  from  closed  curves  to  those 
given  by  functional  variation. 

Surely  the  greatest  advance  in  mathematics  since  antiquity  was  the  indepen- 
dent creation  of  the  calculus  by  Newton  and  Leibniz  in  the  late  17th  Century. 
Newton  viewed  curves  as  representing  physical  motion  and  made  liberal  use  of 
infinite  series.  Leibniz  articulated  curves  with  infinitesimals  and  emphasized  the 
larger  possibilities  for  symbolic  manipulation.  Both  arrived  at  the  fundamental 
connection  between  tangents  to  curves  and  areas  underneath,  Newton  from  the 
former  and  Leibniz  from  the  latter.  But  even  in  this  very  multiplicity  genius  is 
contextual,  not  ineffable,  and  there  is  something  to  Newton’s  remark  that  he  saw 
farther  because  he  had  “stood  on  the  shoulders  of  giants”.  What  Newton  and  Leib- 
niz had  done  was  to  forge  a general  approach  that  subsumed  previous  piecemeal 
results,  an  approach  that  not  only  resolved,  in  the  new  terms,  a host  of  problems 
inherited  from  the  Greeks,  but  suggested  new  problems  and  possibilities  for  the 
emerging  field  of  mathematical  analysis. 

With  the  legacies  of  Newtonian  mechanics  and  Leibnizian  generative  sym- 
bolism, mathematics  expanded  tremendously  in  the  18th  Century,  especially  into 
the  new  domains  of  functions,  infinite  series,  and  differential  equations.  The  latter 
provided  a language  to  describe  physical  phenomena,  and  there  was  overwhelm- 
ing empirical  reinforcement,  particularly  in  celestial  mechanics.  The  century  was 
epitomized  by  Euler,  whose  staggering  output  featured  great  strides  in  inductive 
mathematics  bolstered  by  appeals  to  empirical  evidence,  as  in  physics  today,  and 
remarkable  computational  powers,  foreshadowing  recent  trends  in  mathematics. 

In  the  19th  Century  mathematics  not  only  continued  to  expand  at  a tremen- 
dous rate,  but  it  also  underwent  a transformation  based  on  new  structural  initia- 
tives, a transformation  beginning  in  analysis.  In  the  previous  century  the  vibrating 
string  had  been  much  discussed;  the  physics  suggested  the  superposition  of  many 
frequencies,  leading  symbolically  to  infinite  trigonometric  series,  as  well  as  the 
possibility  of  an  arbitrary  initial  configuration,  which  in  turn  led  to  the  extension 
of  the  concept  of  function  beyond  those  given  by  analytic  expressions.  The  ten- 
sion thus  created  by  this  juxtaposition  of  “infinite  series”  and  “function”  in  new 
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expanded  senses  was  to  lead  to  the  arithmetization  of  analysis  initiated  by  Cauchy 
and  eventually  to  the  creation  of  set  theory  by  Cantor. 

In  order  to  pursue  his  study  of  series  of  functions,  Cauchy  in  the  1820’s 
articulated  the  concept  of  limit,  and  in  its  terms,  convergence  of  series  and  con- 
tinuity of  functions.  This  amounted  to  a reorientation  of  mathematical  analysis 
in  that  divergent  series  were  excised  and  the  ground  laid  for  the  deniability  of  a 
property  of  functions  that  had  been  implicit  in  their  geometric  sense.  Indeed,  a 
discontinuous  counterexample  to  one  of  Cauchy’s  own  assertions,  that  the  sum  of 
a convergent  series  of  continuous  functions  is  continuous,  spurred  a more  careful 
analysis. 

Karl  Weierstrass  in  the  1850’s  introduced  the  familiar  Ve-3<5  style  of  formu- 
lating limits,  and  eliminated  once  and  for  all  the  justificatory  procedures  in  terms 
of  infinitesimals.  With  mathematicians  in  the  process  of  making  their  subject  free 
from  appeals  to  physical  or  geometric  intuition,  this  new  language  was  quickly  ac- 
cepted because  of  its  ability  to  draw  finer  distinctions.  By  this  means  Weierstrass 
was  able  to  rectify  Cauchy’s  assertion  by  incorporating  the  concept  of  uniform 
continuity,  and  moreover  to  formulate  a continuous  yet  nowhere  differentiable 
function,  driving  a wedge  between  continuity  and  differentiability. 

With  infinitesimals  replaced  by  the  concept  of  limit  and  that  cast  in  the  e-8 
language,  a level  of  deductive  rigor  was  incorporated  into  mathematics  that  had 
been  absent  for  two  millenia.  This  may  be  surprising,  but  the  concept  of  proof 
as  first  advanced  by  the  Greeks  had  not  remained  crucial  to  mathematics.  With 
the  creation  of  the  calculus,  the  available  calculational  procedures  and  the  steady 
reinforcement  of  empirical  evidence  had  been  sufficient  to  propel  mathematics 
inductively  forward.  But  with  the  function  concept  steadily  outgrowing  the  New- 
tonian basis  in  physical  motion,  a new  calculus  became  imperative.  Sense  for  the 
new  functions  given  in  terms  of  infinite  series  could  only  be  developed  through 
carefully  specified  deductive  procedures,  and  these  amounted  to  a new  calcula- 
tional technique.  Just  as  proof  for  the  Greeks  had  implicitly  been  the  vehicle 
for  demonstrating  geometric  constructibility,  proof  reemerged  as  an  extension  of 
algebraic  calculation  and  soon  became  intrinsic  as  the  basis  for  mathematics  in 
general. 

With  the  new  articulations  to  be  secured  by  proof  and  proof  in  turn  to  be 
based  on  prior  principles,  the  regress  lead  in  the  early  1870’s  to  the  appearance 
of  several  independent  formulations  of  the  real  numbers,  of  which  Cantor’s  and 
Richard  Dedekind’s  are  the  best  known.  It  is  at  first  quite  striking  that  the  real 
numbers  as  a class  came  to  be  developed  so  late,  but  this  can  be  viewed  against 
the  background  of  the  foregoing  account  as  part  of  a larger  conceptual  shift  from 
intensional  to  extensional  mathematics,  that  is  from  rules  to  objects: 

The  geometric  investigation  of  the  continuum  as  transformed  by  the  calculus 
had  made  functional  variation  central  to  mathematics.  But  functions  were  ini- 
tially identified  with  analytic  expressions,  so  that  they  were  viewed  intensionally 
as  rules,  whether  generating  geometric  figures  or  representing  physical  motion. 
Although  infinitesimal  change  served  to  motivate  differentiation  and  integration 
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as  transformations  of  these  rules,  there  was  no  mathematical  need  to  analyze  the 
continuum  itself.  Mathematics  has  consistently  maintained  invariance  with  such 
minimal  commitments,  but  the  1 9th  Century  articulation  of  limits  and  continuity 
to  be  demonstrated  by  proof  brought  to  bear  new  pressures  toward  an  extensional 
view  of  functions  as  acting  on  points.  With  geometric  assumptions  made  more 
explicit  and  infinite  series  outstripping  sense,  it  became  necessary  to  adopt  an 
arithmetical  view  of  the  continuum  given  extensionally  as  a collection  of  points. 

Cantor’s  formulation  of  the  real  numbers  appeared  in  his  seminal  paper  [72] 
on  Fourier  series;  proceeding  in  terms  of  fundamental  sequences,  he  laid  the  basis 
for  his  theorems  on  sequential  convergence.  Dedekind  [72]  formulated  the  real 
numbers  in  terms  of  his  cuts  to  express  the  completeness  of  the  continuum;  deriv- 
ing the  least  upper  bound  principle  as  a simple  consequence,  he  thereby  secured 
the  basic  properties  of  continuous  functions.  In  the  use  of  arbitrary  sequences 
and  infinite  collections,  both  Cantor’s  and  Dedekind’s  objectifications  of  the  con- 
tinuum helped  set  the  stage  for  the  subsequent  development  of  that  extensional 
mathematics  par  excellence,  set  theory.  Cantor  was  led  to  his  formulation  rather 
pragmatically  to  secure  specific  results,  but  they  were  also  the  results  that  sug- 
gested the  enumerations  leading  to  the  investigation  of  the  transfinite.  Dedekind 
[72]  describes  how  he  came  to  his  formulation  much  earlier,  but  also  acknowledges 
Cantor’s  work. 

Neither  Cantor  nor  Dedekind  regarded  their  respective  formulations  and  its 
correlation  with  an  antecedent  continuum  as  automatic.  Cantor  [72:  97]  wrote: 

In  order  to  complete  the  connection  . . . with  the  geometry  of  the  straight  line,  one 
must  only  add  an  axiom  which  simply  says  that  conversely  every  numerical  quantity  also 
has  a determined  point  on  the  straight  line,  whose  coordinate  is  equal  to  that  quantity  ...  I 
call  this  proposition  an  axiom  because  by  its  nature  it  cannot  be  universally  proved.  A 
certain  objectivity  is  then  subsequently  gained  thereby  for  the  quantities  although  they  are 
quite  independent  of  this. 


Dedekind  [72]  (see  [63: 1 Iff])  wrote: 


If  all  points  on  the  straight  line  fall  into  two  classes  such  that  every  point  of  the  first  class 
lies  to  the  left  of  every  point  of  the  second  class,  then  there  exists  one  and  only  one  point 
that  produces  this  division  . . . 

...  I am  glad  if  everyone  finds  the  above  principle  so  obvious  and  so  in  harmony 
with  his  own  ideas  of  a line;  for  I am  utterly  unable  to  adduce  any  proof  of  its  correctness, 
nor  has  anyone  the  power.  The  assumption  of  this  property  of  the  line  is  nothing  else  than 
an  axiom  by  which  we  attribute  to  the  line  its  continuity,  by  which  we  find  continuity  in 
the  line. 


Dedekind  was  not  above  “proving”  basic  principles  (cf.  his  (in)famous  proof  of 
Theorem  66  in  his  [88],  that  there  is  an  infinite  set),  but  here  he  advocates  an 
axiomatic  correlation  as  does  Cantor.  Dedekind’s  circumlocutions  in  terms  of 
“creation”  via  cuts  led  to  Russell’s  [03:279ff]  criticism.  However,  Russell’s  ex- 
tensional solution  of  simply  defining  the  real  numbers  as  the  cuts,  while  consistent 
with  his  reductionism,  obscures  an  antecedent  sense  of  the  continuum  that  both 
Cantor  and  Dedekind  were  trying  to  accommodate.  Neither  theft  nor  honest  toil 
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sufficed;  both  Cantor  and  Dedekind  recognized  the  need  for  a sort  of  Church’s 
Thesis,  a thesis  of  adequacy  for  the  new  construal  of  the  continuum  as  a collection 
of  points. 

Set  theory  emerged  out  of  this  setting,  with  the  larger  backdrop  of  mathemat- 
ics also  featuring  new  extensional  initiatives  in  the  development  of  algebraic  and 
geometric  structures.  The  birth  of  set  theory  was  attended  by  the  metaphysics  of 
Cantor’s  Absolute;  it  was  raised  on  the  more  subtle  metaphysical  initiatives  from 
logic;  and  throughout,  the  mathematization  of  the  infinite  confronted  concerns 
about  its  very  possibility. 

The  numerical  infinite  of  indefinite  progression  had  long  been  held  to  be  inco- 
herent as  a completed  totality.  With  Aristotle’s  potential  infinite  vs.  actual  infinite 
a traditional  demarcation,  the  occasional  excursions  into  the  latter  had  only  been 
morality  tales  clinched  by  apparently  paradoxical  one-to-one  correspondences.  In 
particular,  the  only  possibility  for  an  answer  to  the  question  “how  many  points 
are  there  on  a line?”  had  been  “potentially  infinite”,  with  “infinite”  in  the  decid- 
edly negative,  etymological  sense  of  “not  finite”.  It  was  Cantor’s  incisive  work 
that  made  of  the  infinite  a positive  concept  and  provided  a structured  sense  to 
the  question  with  the  answer,  2K°.  Cantor  charted  out  the  transfinite  with  simple 
generative  and  arithmetical  rules,  and  thereby  provided  a mathematical  context 
for  the  investigation  of  infinite  collections.  The  infinite,  thus  cast,  was  after  all 
mathematically  coherent. 

As  before  in  mathematics  the  new  language  suggested  a host  of  new  possi- 
bilities, most  notably  2K°  = Kj,  but  the  whole  transfinite  landscape  was  slow  in 
gaining  acceptance.  Philosophical  skepticism  about  the  actual  infinite  may  have 
played  an  inhibitory  role,  just  as  Kant’s  dogma  of  the  a priori  of  Euclidean  ge- 
ometry may  have  done  for  non-Euclidean  geometry,  but  the  main  factor  may  have 
simply  been  the  mathematical  reluctance  to  contemplate  a proliferation  of  new 
entities.  Whereas  the  natural  numbers  were  as  old  as  time  and  at  the  heart  of  the 
intuitive  underpinnings  of  mathematics,  the  transfinite  numbers  seemed  at  first  to 
be  contrived  and  of  little  mathematical  use.  While  Cauchy’s  formulation  of  conti- 
nuity had  been  quickly  accepted  as  the  articulation  of  geometric  assumptions,  there 
was  no  comparable  backing  to  the  transfinite  numbers.  That  reinforcement  was  to 
be  secured  by  the  steadily  increasing  use  of  the  transfinite  leading  eventually  to 
the  full-blown  theory  of  large  cardinals,  and  the  explicit  extensional  casting  of  set 
theory  through  axiomatization. 

As  described  in  the  introduction,  Zermelo  axiomatized  set  theory  in  order  to 
make  explicit  some  underlying  set  existence  assumptions  and  thereby  advanced 
a combinatorial  view  of  sets  structured  solely  by  e and  simple  operations.  The 
vagueness  of  the  definit  property  in  the  Separation  Axiom  invited  Skolem’s  pro- 
posal to  base  it  on  first-order  logic,  and  the  addition  of  the  Replacement  Axiom 
figured  in  von  Neumann’s  reformulation  of  the  transfinite  numbers  as  the  ordinals, 
transitive  sets  well-ordered  by  e. 

Beginning  as  a mathematical  theory  of  extensions,  axiomatic  set  theory  has 
carried  the  weight  of  a larger  significance  having  to  do  generally  with  the  existence 
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of  mathematical  objects.  However,  just  as  Euclid’s  axioms  for  geometry  had  set 
out  the  permissible  geometric  constructions,  the  axioms  of  set  theory  set  out  the 
specific  conditions  for  set  generation,  and  this  in  a new  calculus  based  on  first- 
order  logic.  Unlike  the  emergence  of  mathematics  from  marketplace  arithmetic 
and  Greek  geometry,  sets  and  transfinite  numbers  were  neither  laden  nor  buttressed 
with  substantial  antecedence.  Like  strangers  in  a strange  land,  stalwarts  developed 
a familiarity  with  them  guided  hand  in  hand  by  their  axiomatic  scaffolding,  which 
served  as  new  rules  of  procedure.  In  particular,  existential  propositions  of  set 
theory  are  closely  tied,  even  in  practice,  to  the  3 of  first-order  logic  whose  sense 
is  determined  by  its  logical  rules,  and  do  not  call  for  correlation  with  “existence” 
in  some  larger  sense. 

As  for  the  contextualized  existence  of  infinite  sets,  the  Axiom  of  Infinity  in 
its  usual  formulation  is  just  the  extensional  counterpart  to  the  principle  of  math- 
ematical induction  as  a rule  for  deriving  the  universal  'imp (in).  In  so  far  as  the 
natural  numbers  do  have  an  antecedent  sense,  itup(n)  should  be  correlated  with 
the  informal  counterparts  to  </>(0),  tp(  1),  (p(2). . . ..  However,  the  correlation  here 
is  less  direct  than  that  for  the  continuum  with  formulations  of  the  real  numbers, 
since  number-theoretic  assertions  are  being  made.  Contra  Poincare,  Hilbert  distin- 
guished between  contentual  (inhaltlich)  induction  proceeding  recursively  from  one 
number  to  the  next  and  formal  induction  by  which  inq>(ri)  follows  immediately 
from  the  (p(0)  A in(cp{n)  -a  q>(n  + 1))  and  “through  which  alone  the  mathematical 
variable  can  begin  to  play  its  role  in  the  formal  system”  (Hilbert  [28];  see  van 
Heijenoort  [67: 473]).  With  Cantorian  metaphysics  thinned  out  by  axiomatization, 
there  is  no  larger  sense  of  existence  beyond  the  use  of  formal  induction  in  which 
the  infinite  has  been  domesticated;  a telling  observation  is  that  if  there  is  some 
doubt  about  some  impin')  purportedly  established  by  the  rule,  no  search  is  under- 
taken for  a particular  natural  number  a so  that  <p[«]  fails,  but  rather  the  putative 
proof  is  carefully  scrutinized  much  as  an  arithmetical  calculation  is  checked.  There 
is  no  larger  sense  to  the  Axiom  of  Infinity  other  than  providing  the  extension  of 
formal  induction;  one  consequence  is  that  the  Cantorian  move  against  the  natural 
numbers  as  having  no  end  in  the  traditional  “after”  sense  is  neatly  rendered  by 
extensionalizing  induction  itself  with  the  ordinal  &>,  with  “after”  recast  as  “e”. 

The  transfinite  is  similarly  contextualized  by  Replacement  and  the  principle 
of  transfinite  induction.  Recalling  Cantor’s  unitary  view  of  the  finite  and  the 
transfinite,  the  principle  is  a simple  extension  of  induction  through  limit  points, 
and  the  seeming  exacerbation  of  the  breach  into  the  actual  infinite  amounts  to 
just  contextual  deductions  from  this  new  rule.  From  this  viewpoint,  the  Axiom 
of  Choice  can  be  regarded  as  a similarly  necessary  principle  for  infusing  the 
contextualized  transfinite  with  the  order  already  inherent  in  the  finite. 

As  described  in  the  introduction,  the  Foundation  Axiom  and  the  iterative 
conception  of  set  converted  set  theory  into  a study  of  well-foundedness.  After 
the  infusion  of  model-theoretic  techniques,  the  development  of  forcing  and  inner 
model  theory  established  set  theory  as  a sophisticated  and  distinctive  field  of 
mathematics.  And  to  repeat,  formalized  versions  of  truth  and  consistency  became 
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matters  for  combinatorial  manipulation  as  in  algebra,  with  large  cardinals  providing 
an  elegant  and  fully  sufficient  superstructure  for  the  study  of  consistency  strength. 

Large  cardinal  hypotheses  as  existential  propositions  have  a distinctive  status 
in  so  far  as  relative  consistency  results  have  made  of  them  a gauge,  the  interplay 
between  forcing  and  inner  models  becoming  part  of  their  collective  sense  in  set 
theory.  Just  as  large  finite  numbers  became  surveyable  through  arithmetical  nota- 
tion, so  also  did  large  cardinals  through  their  set-theoretic  formulations  as  part  of 
the  calculus  of  Zermelian  set  theory.  And  just  as  large  finite  numbers  seem  hope- 
lessly inaccessible  in  terms  of  counting  one  by  one,  so  also  do  large  cardinals  in 
terms  of  hierarchically  simpler  processes.  But  just  as  we  can  work  with  lO1,000,000 
in  the  proper  context,  so  we  can  work  with  a measurable  cardinal. 

What  about  the  role  of  set  theory  as  a foundation  for  mathematics?  As  a 
mathematical  theory  of  extension,  a small  part  of  Zermelian  set  theory  can  serve 
as  an  ambient  framework  for  most  of  ongoing  mathematics.  Recapitulating  the 
history  above,  rules  can  be  extensionalized  and  then  further  recast  in  terms  of 
sets,  e,  and  =.  Be  this  as  it  may,  this  reduction  does  not  necessarily  clarify,  for 
generally  speaking,  mathematics  operates  at  various  levels  of  organization  and 
articulation:  Number  theory  thrives  first  at  the  level  of  arithmetical  rules  and 
then  at  the  level  of  analytic  superstructures,  far  above  Frege  and  even  Peano. 
Mathematical  analysis  thrives  first  at  the  level  of  Weierstrass  and  then  at  the  level 
of  functional  superstructures,  with  only  a faint  nod  to  Cantor  and  Dedekind.  Like 
the  relation  of  organic  chemistry  to  particle  physics,  reducibility  is  acknowledged, 
but  the  emphasis  is  rather  on  the  possibilities  afforded  by  specific  conceptual 
schemes  at  different  levels  of  organization. 

On  the  other  hand,  as  a study  of  well-foundedness  ZFC  together  with  the 
spectrum  of  large  cardinals  serves  as  a court  of  adjudication,  in  terms  of  relative 
consistency,  for  mathematical  propositions  that  can  be  informatively  contextual- 
ized in  set  theory  by  letting  their  variables  range  over  the  set  theoretic  universe. 
Thus,  set  theory  is  more  of  an  open-ended  framework  for  mathematics  rather  than 
an  elucidating  foundation,  ft  is  as  a field  of  mathematics  that  both  proceeds  with 
its  own  internal  questions  and  is  capable  of  contextualizing  over  a broad  range 
which  makes  of  set  theory  an  intriguing  and  highly  distinctive  subject. 

What  then  is  left  for  philosophy?  Existence,  truth,  and  knowledge,  when 
taken  in  the  large,  are  ultimately  contentious  subjects  for  debate  rather  than  con- 
cepts for  explication.  The  subjects  of  much  philosophizing,  these  as  well  as  the 
whole  package  of  dichotomies  like  objective/subjective,  realist/anti-realist,  and 
contingent/necessary  are  of  unlimited  fluidity  and  variability.  For  example,  a ta- 
ble, the  moon,  and  unicorns  exist  in  such  plainly  different  ways  that  existence  per 
se  cannot  have  sense  as  a prior  category.  It  is  only  through  communication  and 
learning  that  we  become  familiar  with  the  various  uses  of  the  term  as  part  of  our 
language.  This  being  the  case,  such  general  terms  cannot  serve,  unreflectively  or 
with  short  “definitions”,  as  the  beginning  of  some  analysis,  but  must  themselves 
be  subject  to  contextualized  description.  Mathematics  has  long  been  held  up  as  a 
paragon  of  clarity  and  knowledge,  but  even  then,  assertions  like  “mathematics  is 
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true”  or  “numbers  exist”  are  without  antecedent  sense  and  must  be  developed  and 
argued  for  in  toto,  each  like  a new  sentence  formed  from  familiar  words. 

But  where  to  begin,  and  how  far  to  go?  Considerations  of  assertions  like 
“a  sea-battle  will  take  place  in  the  Aegean  tomorrow”,  “the  evening  star  is  the 
morning  star”,  or  even  “7  + 5 = 12”  should  begin  with  their  evidently  invariant 
feature,  the  words  themselves.  And  such  considerations  can  arguably  proceed  at 
most  to  a description  of  their  interplay  as  part  of  how  we  use  language,  if  they  are 
not  to  overleap  the  bounds  of  invariance  and  become  enmeshed  in  metaphysics. 
According  to  Wittgenstein  in  Philosophical  Investigations  (§109): 


. . . We  must  do  away  with  all  explanation,  and  description  alone  must  take  its  place.  And 
this  description  gets  its  light,  that  is  to  say  its  purpose,  from  the  philosophical  problems. 
These  are,  of  course,  not  empirical  problems;  they  are  solved,  rather,  by  looking  into  the 
workings  of  our  language,  and  that  in  such  a way  as  to  make  us  recognize  those  workings; 
in  despite  of  an  urge  to  misunderstand  them.  The  problems  are  solved,  not  by  giving 
new  information,  but  by  arranging  what  we  have  always  known.  Philosophy  is  a battle 
against  the  bewitchment  of  our  intelligence  by  means  of  language. 


A historical  account  was  given  above  “arranging  what  we  have  always 
known”  to  describe  existence  as  contextualized  in  set  theory.  As  for  truth,  the 
attitude  is  similar.  To  be  bold,  mathematical  truth  is  what  we  have  come  to  make 
of  it.  As  for  knowledge,  description  ultimately  provides  no  insights  beyond  de- 
scription. To  be  bolder  still,  it  may  be  that  we  cannot  say  anything  other  than 
that  the  acquisition  of  mathematical  knowledge  may  be  just  what  happens.  Like 
morning  lilies  opening  at  dawn  or  squirrels  saving  up  nuts  in  trees,  we  just  do  it. 

To  pursue  an  analogy,  the  world  of  mathematics  is  like  a great  cathedral.  The 
thick  stone  walls  along  the  stately  aisles  still  show  the  lines  of  the  ancient  church 
that  predated  the  grand  edifice.  The  central  dome  is  supported  by  high  arches  of 
vaulting  stone,  resilient  reminders  of  the  anonymous  master  masons.  Whatever 
their  design,  the  arches  have  easily  supported  the  elegant  latter-day  spires  reaching 
high  into  the  sky.  The  first  adornments  can  still  be  seen  in  the  oldest  chapels; 
there  in  continuing  communion  with  the  past  steady  additions  are  made,  each 
new  age  imparting  its  own  distinctive  style.  In  recent  memory  large  new  side 
chapels  have  been  constructed,  and  new  flying  buttresses  for  extra  support.  Every 
day  the  curious  enter  through  the  great  door  of  polished  wood  with  the  attractive 
inset  figures.  Several  venture  down  the  long  nave  seeking  instruction,  and  a few 
even  initiation,  quite  taken  by  the  the  order  and  beauty  of  the  altar.  And  the 
work  continues:  The  architects  attempt  to  chart  out  large  parts  of  the  cathedral, 
some  even  proposing  vast  renovations.  The  craftsman  continue  the  steady  work 
on  the  new  wood  paneling,  the  restoration  of  the  sculpture,  and  the  mortaring  of 
the  cracks  that  appear  with  age.  And  supported  by  high  scaffolding,  the  artisans 
continue  to  work  on  the  fine  stained  glass.  They  try  to  coordinate  with  their 
colleagues  in  the  adjoining  frames,  but  sometimes  the  heady  heights  inspire  them 
to  produce  new  gems.  Those  who  step  back  see  a larger  scheme,  but  they  cannot 
see  across  the  whole  breadth.  And  they  are  so  high  up  that  they  can  no  longer 
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see  their  supports.  Nevertheless,  they  are  sustained  as  a community,  as  part  of  the 
ongoing  human  adventure. 

To  append  an  apocryphal  tale:  A host  of  industrious  spiders  started  to  build 
an  elaborate  network  of  webs  in  newly  excavated  vaults  beneath  the  cathedral. 
It  quickly  grew  so  thick  and  complex  that  no  one  could  venture  across  without 
getting  enmeshed.  One  day,  a fearful  wind  came  howling  in  and  blew  a gaping 
hole  through  the  network,  and  in  desperate  response  the  spiders  worked  frantically 
to  reestablish  the  connections.  For  you  see,  the  spiders  had  become  convinced  that 
their  carefully  constructed  webbing  was  the  foundation  without  which  the  entire 
cathedral  would  totter.  Of  course,  the  craftsmen  above  hardly  raised  an  eyebrow. 

There  are  some  remaining  possibilities  for  metaphysical  appropriation  that 
should  be  forestalled.  Mathematicians  themselves  have  often  described  a feeling 
of  dealing  with  autonomous  objects,  some  professing  an  avowedly  realist  view 
of  mathematics.  The  reply  is  that  this  objectification  is  part  of  the  practice  of 
mathematics,  the  sense  of  existence  here  to  be  described  as  in  any  other  concerted 
human  activity.  Among  many  examples  of  the  sort,  one  should  remember  how 
roundly  Edward  Gibbon  was  criticized  by  the  faithful  for  having  presented  the 
emergence  of  Christianity  as  part  of  history.  And  there  is  the  analogy  with  the 
blindfold  chess  player  who  can  play  out  entire  games;  he  may  visualize  the  pieces 
on  a particular  chessboard,  but  in  the  end  what  remains  is  the  structure  of  the  game 
as  communicated  by  him  through  the  notation.  Finally,  Godel’s  realist  arguments 
in  [47]  have  been  much  discussed,  no  doubt  in  part  because  of  the  significance  of 
his  mathematical  results.  But  again,  it  is  the  invariance  of  those  results  that  lies  at 
the  heart  of  the  matter.  Godel  himself  much  admired  the  work  of  Robinson  and 
wanted  him  as  his  (Godel’s)  successor  at  the  Institute  for  Advanced  Study;  that 
Robinson  [65]  was  a committed  formalist  was  never  of  mathematical  consequence. 

But  even  with  the  metaphysics  thinned  out,  there  is  still  the  recurrent  feeling, 
familiar  to  the  working  mathematician,  of  questions  being  induced  by  a context, 
and  once  resolved,  a gripping  sense  of  inevitability  about  their  solutions.  This 
suggests  the  possibility  of  a new  metaphysical  appropriation,  as  the  mathemati- 
cian seems  to  be  impelled  to  extend  the  boundaries  of  order  against  a chaos  of 
possibilities.  But  this  feeling  is  also  familiar  to  the  artist,  who  can  proceed  straight- 
forwardly at  various  junctures  once  the  context  has  largely  precipitated;  as  James 
Joyce  in  Ulysses  (17: 1012-15)  describes  Dedalus, 

He  affirmed  his  significance  as  a conscious  animal  proceeding  syllogistically  from  the 
known  to  the  unknown  and  a conscious  rational  reagent  between  a micro  and  a macrocosm 
ineluctably  constructed  upon  the  incertitude  of  the  void. 


Even  with  this  said,  there  may  still  be  lurking  some  inchoate  feeling  that  the 
fact  of  mathematics  still  calls  for  some  kind  of  explanation.  Beyond  any  concerns 
about  its  unreasonable  effectiveness  in  the  natural  sciences,  there  may  remain  a 
larger  feeling  of  mystery  about  how  the  world  of  mathematics  has  come  about 
and  fits  together  into  a coherent  whole.  But  this  final  possibility  for  metaphysical 
appropriation,  along  with  the  more  traditional  musings  about  the  starry  heaven 
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above  or  the  moral  law  within,  are  not  in  the  world  but  of  the  mystical,  part  of 
the  feeling  for  the  unity  of  experience  in  the  large.  According  to  Wittgenstein  in 
the  Tractatus  (6.41): 

The  sense  of  the  world  must  lie  outside  the  world.  In  the  world  everything  is  as  it 
is  and  happens  as  it  does  happen.  In  it  there  is  no  value  - and  if  there  were,  it  would  be 
of  no  value. 

If  there  is  a value  which  is  of  value,  it  must  lie  outside  all  happening  and  being-so. 

For  all  happening  and  being-so  is  accidental. 

What  makes  it  non-accidental  cannot  lie  in  in  the  world,  for  otherwise  this  would 
again  be  accidental. 

It  must  lie  outside  the  world. 


Again  from  the  Tractatus  (6.44): 

Not  how  the  world  is,  is  the  mystical,  but  that  it  is. 

And  this  in  Wittgenstein’s  dialectical  distinction  can  at  most  be  shown,  not  said, 
leading  in  one  direction  to  his  admonition  to  silence  at  the  end  of  the  Tractatus. 
Neither  metaphysics  nor  solipsism,  it  is  that  part  of  human  experience  beyond 
human  discourse.  The  Tao  Te  Ching  of  Lao-Tzu  begins: 

The  Tao  that  can  be  told  is  not  the  eternal  Tao. 

The  name  that  can  be  named  is  not  the  eternal  name. 
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Absolute,  Cantor’s  XII,  XV,  XVII,  31, 42, 313, 
477 

Absolute  relation  174 
AC  ( see  Axiom  of  Choice) 

ACft,;  A Ctoi*0 co)  ( see  Countable  Axiom  of 
Choice) 

Ackermann’  s set  theory  (A)  311-313,318 

AD  ( see  Axiom  of  Determinacy) 

ADr  329, 377, 399-402, 441, 458, 467^171 
AD^2  469 
Algebra  93 

Almost  huge  cardinal  331, 333, 334, 338 

a -branch  (of  a tree)  72 

a -inaccessible  cardinal  27,  33 

a-Mahlo  cardinal  27,  33, 41,  58 

ot-Il\{a)  set  443- 449 

ath  iterate  (of  a structure)  250 

oi th  level  (of  a tree)  72 

a -weakly  inaccessible  cardinal  16, 17 

a - weakly  Mahlo  cardinal  7 7, 2 1 6 , 2 1 7 

Amenable  pair  187,  244 

Amenable-generic  set  187 

Analytic  set  147,  148 

Analytical  hierarchy  155 

Analytical  relation  1 5 1 , 755 

- in  a 156 
Antichain 

- of  a tree  72 

- of  conditions  118 

- of  conditions  below  p 118 

Application  ( ap ) 752 

A-set  147,  148 
a t (see  also  Daggers)  294 
a#  (see  also  Sharps)  110 
Arithmetical  hierarchy  154 
Arithmetical  relation  151,754 

- in  a 156 

Aronszajn  tree  (see  /r-Aronszajn  tree) 

Atom  (for  a measure)  24 
Atomless  measure  24 

Axiom  of  Choice  (AC)  XIII,  XV,  1 , 2 1 , 22, 28, 
32, 34, 71, 101,  1 14,  132,  167, 257, 324, 329, 
372, 377, 478 

Axiom  of  Constructibility  (V  = L)  (see  also  L ) 
28, 44,66 

Axiom  of  Determinacy  (AD)  31,  167,  192, 
204, 377-379, 382,  §28, 414, 415, 425^136, 
449, §32 

Axiom  of  Infinity  XIV,  58, 478 
Axiom  of  Reducibility  XIII,  29 


Baire  Category  Theorem  13 
Baire  function  145 
Baire  property  13,  14, 136 

- for  all  sets  from  inaccessibility  132, 136 

- for  all  sets  under  AD  377, 468 

- for  sets  181 

- for  provably  A^  sets  1 80 

- for  E\  (analytic)  sets  148 

- for  sets  179 

- for  E*+l  sets  under  Det(77^)  380 

- for  weakly  homogeneously  Suslin  sets  457 

- in  L 169 

- in  L[U]  273 

- via  games  374 
Baire  space  77 
Banach  algebra  450 
Begriffsschrift  XIII 

Berkeley  (see  University  of  California  at 
Berkeley) 

Boolean  algebra  26, 116, 120, 129, 213, 214, 
223 

Boolean- valued  model  1 14,  1 16, 1 17, 223 

Borel  Determinacy  372, 373, 379, 440-442 
Borel  hierarchy  146,  149,  151,  152 
Borel  measure  (mg)  12 
Borel  set  (see  also  y -Borel  set,  oo-Borel  set, 
Borel  Determinacy)  12, 145-149,  165 
Borel  Wadge  Determinacy  415, 416, 442 
Bounded  quantifier  5, 153 
Branch  (of  a tree)  72 
Burali-Forti  Paradox  XIV 

Cabal  Seminar  368,403,450 
CH  (see  Continuum  Hypothesis) 

Chain  (of  a tree)  72 
Chang’s  Conjecture  85,  87,  88, 282, 292 
Closed  code  137 
Closed  set 

-in  VKy  340, 341 

- of  "X  377,372 

Closed  unbounded  filter 

- over  X (C\)  3 

- o\qtVky  (CK,Y)  347,342 

Closed  unbounded  set  (see  also  v-closed 
unbounded  set) 

- in  X 2,3 

-in  7 \y  340 

Coding  Lemma  397, 398, 425, 428 
Cofinal  branch  (of  a tree)  72 
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Cohen  real  727, 130, 177, 179-181 
Coinductive  set  430 
Collapsing  Lemma  7, 48,  62 
Compactness  Theorem  XV,  36, 39 
Comparable  structures  265 
Comparison  Lemma  (for  ZFC-  premice)  266 
Compatible  conditions  ( p ||  q ) 118 

Complete  Boolean  algebra  116,  120,273,214, 
223 

Complete  embedding  119, 120 
Complete  set  (of  Skolem  functions)  8 
Completeness  Theorem  XV,  102, 185 
Condition  117 
Constructibility  degree  186 
Constructive  closure  ( see  L(A )) 

Constructive  hierarchy  (see  L) 

Constructive  set  (see  also  L)  28, 29 
Continuously  closed  class  474,415 
Continuum  Hypothesis  (CH)  XII,  1,28-32, 
114-116, 133, 134,378 

Countable  Axiom  of  Choice  (AC<y)  132, 433 

- restricted  to  sets  of  reals  (A Qy  ("<*>))  152, 

159, 162,378,403,417 

Countably  complete  tower  451 
Countably  complete  ultrafilter  254, 256 
Countably  iterable  structure  254, 255 
Covering  Theorem  111,  443 
Critical  point  (crit(y'))  47 
Cumulative  hierarchy  XIII,  XIV,  XVI,  XVII,  1 , 
20 

Daggers  294, 444, 445, 468 

DC  (see  Principle  of  Dependent  Choices) 

DC("<u)  434 

definit  property  XIII,  19 

A|  sets 

- Baire  property  169,  181 

- Determinacy  467 

- Lebesgue  measurability  150, 169, 181 
A -system  121 

A-system  Lemma  121 

Dense  embedding  119, 120 

Dense  open  set  (of  conditions)  118 

Dense  set  (of  conditions)  118 

Derived  set  (A')  133 

Det(A)  379 

Determinacy 

- of  Borel  sets  (see  Borel  Determinacy) 

- of  77/  sets  437^40 

- of  77]  sets  467 

- of  77]  sets  329,449,467 

- of  projective  sets  (see  Projective  Determinacy) 

Determined  game  370 

Determined  set  370 
Diagonal  intersection 

- of  sets  indexed  by  A.  2 

- of  sets  indexed  by  [k]<(°  237,  242 

- of  subsets  ofV(S)  301 


Difference  hierarchy  (Diff(A))  (see  also  a-Il\  (a) 
set)  442 
Direct  limit  9,10 
Directed  set  9 
Directed  system  9, 10 
Dual  filter  (of  an  ideal)  210 

Easton  product  122, 228, 230, 232 
Elementary  embedding  (see  also  Xn  -elementary 
embedding)  44, 46 

Elementary  substructure  (see  also  Xn  -elementary 
substructure)  39, 44 
EM  blueprint  707-107 

- for  a 110 

EM2  blueprint  284- 288 

- for  a 294 
e -model  4 

Equiconsistency  result  XIX,  4, 135 

Equivalent  ultrafilters  269 

Erdos  cardinal  (k (a))  80- 84 

Erdos-Rado  Theorem  74 

77-extendible  cardinal  377-318 

77 -extendible  for  A cardinal  339 

77 -supercompact  for  A cardinal  339 

Extender  353-355,  §26 

Extendible  cardinal  377-312, 315-318, 322 

Extension  property  39,  76 

Extensional  relation  7 

Filter  3 

Fine  filter  301 

Fine  structure  theory  30, 43 1 

First  Periodicity  Theorem  410 

Foundation  Axiom  XIV,  20, 478 

Friedman  game  397 

Fa  set  72 

F-stationary  set  53 

Function  quantifier  153 

Fundamenta  Mathematicae  22 

Game  369, 370 
Game  quantifier  427 

F-good  well-ordering  169, 273, 409, 422, 463 
F-norm  406, 407 
F-scale  418 
y -Borel  set  424-427, 469 
y -compact  cardinal  307 
y -strong  cardinal  358,  359 
y -strong  for  A cardinal  363 
y -supercompact  cardinal  (see  also  77- 
supercompact  for  A cardinal)  298, 
§22,316,317, 348,349,358 
GCH  (see  Generalized  Continuum  Hypothesis) 
Gs  code  737 
Gs  set  72 

Generalization  (argument)  XVII,  32 
Generalized  Continuum  Hypothesis  (GCH)  1, 
16,28,263 

Generic  elementary  embedding  459, 460 
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Generic  extension  118 

Generic  ultrapower  210 

Genericity  Conjecture  186 

Global  Axiom  of  Choice  59 

Good  well-ordering  ( see  r~ good  well-ordering) 

Ground  model  118 

Harvard  University  115 
Hebrew  University  309 
Height  (of  a tree)  72 
Hereditarily  "On-definable  set  1 41 
Hereditarily  ordinal  definable  set  466 
HOD  466 

Homogeneous  Boolean  algebra  129 
Homogeneous  set  (for  a function) 

- on  an  ordinal  71 
~ on  [VKy]nc  346 
Homogeneous  tree  1 95, 45 1 , 452 
Homogeneously  Suslin  set  452 
Honest  leftmost  branch  176, 204, 418 
Huge  cardinal  233, 331, 334, 358, 449 
Hyperarithmetic  hierarchy  157 
Hyperarithmetic  set  157, 166 
Hyper Woodin  cardinal  365 

10  325,329,449 

11  325- 329, 468 

12  325- 329, 449 

13  325- 329 
Ideal  3 

Ideal  over  k 211 

Inaccessible  cardinal  ( see  also  a -inaccessible 
cardinal,  a -weakly  inaccessible  cardinal, 
Weakly  inaccessible  cardinal)  18, 26, 38, 
57, 83, 115,  125, 127, 132, 136,  144 

- co i inaccessible  from  reals  135,319 
Incompatible  conditions  (p  _L  q)  118 
Incompleteness  Theorem  XV,  28, 70,  151 

- Second  19, 399 
Indescribable  cardinal  58,  §6 
Indescribable  filter  65, 66 
Indescribable  in  k set  64 
Indiscemibles,  set  of  99, 253 

- double  284 

Inductive  definability  430 
Inductive  set  (of  reals)  430 
oo-Borel  set  469 

Inner  model  33 

- of  measurability  §20 

Institute  of  Advanced  Study  28, 30, 114 
Isolated  point  (of  a set  of  reals)  133 
Iterable  M-ultrafilter  250 
Iterable  structure  250, 255-257 
Iterate  (of  a structure)  250 
Iterated  forcing  121, 123, 124 
Iterated  ultrapower  §19 
Iteration  (of  a structure)  250 
Iteration  tree  463 


Iterative  conception  (of  set)  ( see  also  Cumulative 
hierarchy)  XV 

Jonsson  algebra  93 

Jonsson  cardinal  93- 98,  128,275,282,292, 

396 

/c-Aronszajn  tree  78- 80, 384 

/c-Baire  partial  order  121 
k -chain  condition  (for  a partial  order)  121 
k -closed  partial  order  121 
k -homogeneous  tree  451,452 
k -homogeneously  Suslin  set  452 
k -model  (see  also  L[U])  261 

k -product  (of  partial  orders)  122 
/r-tree  75 

k -weakly  homogeneous  tree  453-4 59, 463 
k -weakly  homogeneously  Suslin  set  453-459, 
464 

Kleene-Brouwer  ordering  (<kb)  762, 193, 194 

- <kb2  version  194 
Kunen-Paris  forcing  226,  227 

L 28,  §3, 44, 49, 62, 66, 88,  89, 93,  §9,  150, 

151, 167-170, 277-283 
L 199 

L[A]  (see  also  L[U])  34 

L(A ) (see  also  L( IR))  34,  51, 359 

L( R)  142, 378, 381, 398, 43 1^134, 461, 464, 
466, 467 

L[U]  261,  §20, 283-295 
1 

C%  101 , 102 
Ce(A)  245 
Ce(A)**  284 

La.m  36,251 
315 

LJ  ^-compact  cardinal  315 

A -additive  measure  23 

^-complete  filter  3 

^.-complete  ideal  3 

A-dense  ideal  470, 471 

^-saturated  ideal  277,  §16,  §17 

Lebesgue  measurability  72, 13, 61, 133 

- from  Woodin  cardinals  461 

- in  L 169 

- in  L[U]  213 

- of  all  sets  from  inaccessibility  132,  136, 179 

- of  all  sets  under  AD  377, 468 

- of  A\  sets  181 

- of  provably  A\  sets  1 80 

- of  (analytic)  sets  148 
-offsets  149,178 

- of  £},+i  sets  under  Det(77 ,J)  380 

- of  weakly  homogeneously  Suslin  sets  457 

- via  games  376 

Lebesgue  measure  (mi)  12, 22, 24 

- algebra  136 
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Left  distributive  law  329 

Length  (of  an  iteration)  250 

Levy  Collapse  (Col(A,  /c))  128, 135, 139, 221 

Levy  hierarchy  5 

Limit  point 

- of  a set  of  ordinals  2 

- of  a set  of  reals  133 
Los’s  Theorem  9, 47,  52, 245 
Lowenheim-Skolem  Theorem  XIV,  8, 19 

- for  40 

Mahlo  cardinal  (see  also  a-Mahlo  cardinal, 

O' -weakly  Mahlo  cardinal.  Weakly  Mahlo 
cardinal)  21,  57,  58,  125 
Mahlo’s  Operation  18,  54,  66, 216 
Martin' s Axiom  (MA)  124,  167,  179,219, 426, 
450, 460 

- for  k (MA(k))  124 

Martin's  Maximxnn  460 
Mathematische  Annalen  1 6 
Mathias  real  745,144,238 
Maximal  Principle  119 
meager  set  73,  373 

Measurable  cardinal  26, 27,  38,  39, 44, 49-56, 
61,  83,90, 126, 179, 195,210-212, 221, 

224, 226-237,  §20, 283-295, 299, 312, 359, 
384-389,434,444,445 
measure 

- on  a tr-algebra  12 

- over  S 23 

Measure  Problem  22, 23,132 
Mild  extension  125,  126 
Minimal  cover  (of  a set  of  reals)  376 
Move  (of  a game)  370 
M-ultrafilter  244-251 

Name  118 

Natural  sequence  (of  structures)  336- 339 

n-huge  cardinal  337-333 
TV-l-saturated  ideal  227 
TV-normal  ultrafilter  227 
Non-rigidity  (of  an  inner  model)  277 
Nonselfdual  class  474,415 
Non-stationary  ideal  over  X (NS;J  3 
Norm  406 

Normal  filter  (see  also  Strongly  normal  filter) 

- over  X 52— 56 

- over  VicS  301 

- over  V(X)  332 

Normal  ideal  over  tc  214 
Nowhere  dense  set  73 
v-closed  unbounded  set  2,  3,  391 
v-Rowbottom  cardinal  88— 92 

v-Rowbottom  filter  218 
v-satisfiable  set 

- of  sentences  36 

- of  L"^  sentences  315 
Null  set  (of  reals)  12 
Number  quantifier  153 


cu-Jonsson  function  379-321,325,347 

“On-definable  set  739 

oil -dense  ideal  470,471 

Operation  (A)  147,  148 

Open  code  737 

Open  set  (of  “X)  377,372 

Order-preserving  map  (for  trees)  160 

Ordinal  definable  set  139,381,466 

Partial  order  (p.o.)  777 

Partial  play  (of  a game)  370 

- according  to  a (resp.  r)  370 

Partition  property  (for  ultrafilters)  346- 349 

Partition  relation 

- infinite  exponent  71,391-396 

- of  all  finite  subsets  5(7-84 

- ordinary  77-77 

- square  brackets  85,  §8 
Payoff  (of  a game)  370,  371 
pcf  theory  97 

Perfect  set  733 

Perfect  set  property  133,  734,  310 

- for  all  sets  from  inaccessibility  132,  136 

- for  all  sets  under  AD  377 

- for  Borel  sets  147 

- for  77/  sets  149,  170,  184 

- for  27/  (analytic)  sets  148, 183 

- for  X/  sets  184 

- for  77(J+|  sets  under  Det(77,J ) 380 

- for  weily  homogeneously  Suslin  sets  457 
-in  L 150, 169,  170 

- in  L[U]  273 

- via  games  375 

P -generic  779 

- over  M 118 

Philosophical  Investigations  480 
77/  sets 

- Determinacy  437-440 

- Perfect  set  property  149,  150,  170,  184 

- Tree  representation  162 

- Uniformization  150,  175-177 

- Union  of  !5i  Borel  sets  163 
77/  sets 

- Determinacy  329, 449, 467 

- Perfect  set  property  273 

- Tree  representation  192,196,201 

-Uniformization  150,177,204 
77)  Singleton  Conjecture  187 

77/  singleton  187 
Play  (of  a game)  370 

- according  to  0 (resp.  r)  370 
P-name  118 

p.o.  777 

Predense  set  (of  conditions)  118 
Premouse,  ZFC-  264- 266 

Prewellordering  118,406 
Prewellordering  property  407 

- for  77/  (a)  sets  408 
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- for  E^(a)  sets  408 

- in  L 410 

- in  L[U]  410 

- under  PD  410 

Prikry  forcing  234,  § 1 8, 259-260 
Prikry  sequence  235 
Princeton  University  30,  31,  114 
Principia  Mathematica  XIII 
Principle  of  Dependent  Choices  (DC)  132, 
142, 152,378,433 

Proceedings  of  the  National  Academy  of  Sciences 
U.S.A.  28 

Product  forcing  121- 123 
Product  measure  (on  K2  (mK))  222 

Product  measure  forcing  222,  223 
Projection  (of  a set  (pB))  147 

Projective  class  403 

Projective  Determinacy  (PD)  379, 406,410, 
421^125,468 

Projective  hierarchy  148, 149 
Projective  ordinal  423, 428, 429, 434-436, 469 
Projective  set  148 
Provably  A\  set  179, 180 

Quotient  Boolean  algebra  213 

Radon-Nikodym  Theorem  225 
Ramification  Lemma  72 
Ramsey  cardinal  81,  84,  89, 126, 263, 275 
Ramsey  property  144, 382, 458 
Ramsey’s  Theorem 

- finite  version  70,  100 

- infinite  version  70, 75,  76 
Random  real  138, 178,  180, 181, 223 
Rank  function  7 

- on  a tree  160 

Real  game  quantifier  430 
Real-valued  measurable  cardinal  24,  80, 210, 
219, 224 

Recursion  Theorem  151,  368, 425 
Recursive  relation  154 
-in  a 156 

Reduction  property  404, 405^107, 4 1 5 

- for  Tl\  sets  405 

- for  Ej  sets  406 

- under  PD  410 

Reflection  (argument)  XVII-XVIII,  30, 3 1 
Reflection  Principle  3 1 , 58 
Regressive  function  2 
Relative  constructibility  (see  L[A ]) 

Replacement  Axiom  (Schema)  XIV,  XV,  2,  58, 
115,440, 477,478 
Root  (of  a tree)  72 

Rowbottom  cardinal  88, 91, 92, 96, 128, 396 
Rowbottom  filter  218,236 
R- set  180 

Russell’s  Paradox  XIII,  XIV 
Satisfiable  set 


- of  Lx //.  sentences  36 

- of  LnXjJb  sentences  315 

Saturated  ideal  277,  §16,  §17 
Scale  177, 417, 418, 430, 431 
Scale  property  475,419 

- for  inductive  sets  430 

- for  Il\  sets  419 

- for  E\  sets  420 

- for  Ey  sets  431 

- under  PD  421 
Schroder-Bemstein  Theorem  134 
S-closed  forcing  460 
Second-order  arithmetic  7 52, 4 1 6 

- in  a 156 

Second  Periodicity  Theorem  421 
Semi-proper  forcing  460 
Separation  Axiom  (Schema)  XIV,  19, 30, 245, 
477 

Separation  property  404,415 
Separative  partial  order  120, 129, 130 
Separative  quotient  (of  a partial  order)  120, 
121 

Set-generic  set  186 

Sharps  110, 188-191, 198-207,420,425,439, 
444, 445 

Shelah  cardinal  46 1 
a -algebra  77 

En  -elementary  embedding  45, 46, 28 1 
En  -elementary  substructure  45 
E\  Skolem  hull  279 
E^  (analytic)  sets 

- Bounding  principle  164 

- Lebesgue  measurability  148 

- Perfect  set  property  148, 183 

- Union  of  N 1 Borel  sets  1 63 
E\  sets 

- Absoluteness  174, 175 

- Baire  property  179 

- Lebesgue  measurability  178, 179 

- Perfect  set  property  183, 184 

- Tree  representation  173, 174 

- Uniformization  177 

- Union  of  ^1  Borel  sets  149, 166, 167, 174 
E\  sets 

- Absoluteness  1 97 

- Tree  representation  192,201,204 

- Uniformization  206 

- Union  of  ^2  Borel  sets  204 
Singular  Cardinals  Problem  722, 217,  308 
Skolem  function  8 

Skolem  hull  (and  argument)  (see  also  E\  Skolem 
hull)  8, 101,  266,  284 
Skolem  paradox  XIV 
Skolem  term  8 

- for  0#  101 

- for  L[U]  266 

- for  0*  284 

Special  *>tree  79,  80 
Solovay  game  387 
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Spector-Gandy  Theorem  422 

Stanford  University  1 14, 244 

Stationary  coding  set  350 

Stationary  set 

-in  A 2,3,17,217 

-in  7 \y  340, 341 

Stationary  tower  forcing  462, 464, 466 

Strategy  370 
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